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The  genetic  polymorphism  of  the  murine  major 
histocompatibility  complex  class  II  Ab  gene  is  generated  by 
the  slow  accumulation  of  point  mutations  over  long 
evolutionary  time  periods.   These  point  mutations,  which  are 
predominantly  located  in  the  antigen  binding  site, 
frequently  result  in  nonsynonymous  (amino  acid  replacement) 
mutations,  and  usually  change  the  biochemical  class  of  amino 
acid.   This  diversity  is  then  extensively  amplified  by 
mechanisms  that  dramatically  modify  the  antigen  binding  site 
in  a  single  step;  intra-exonic  recombination  between 
different  alleles  of  Ab  exon  2  (which  contains  the  Ab 
portion  of  the  antigen  binding  site) ,  and  the  introduction 
of  amino  acid  deletions.   Consequently,  natural  mouse 
populations  contain  an  array  of  Ab  alleles  with  highly 
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divergent  antigen  binding  site  structures  and  presumably 
antigen  binding  properties.   The  accumulation  of  such  rare 
and  unusual  genetic  events  specifically  within  the  antigen 
binding  site  of  the  Mhc  class  II  Ab  gene  suggests  that 
specialized  selective  mechanisms  may  favor  the  maintenance 
of  alleles  encoding  highly  divergent  forms  of  the  antigen 
binding  site.   This  type  of  selection,  referred  to  as 
divergent  allele  advantage,  may  act  in  concert  with  other 
forms  of  balancing  selection,  and  drive  the  diversification 
of  the  antigen  binding  site  by  selectively  maintaining  the 
most  divergent  Mhc  alleles  within  populations. 
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CHAPTER  I 
INTRODUCTION 


A  crucial  step  in  the  initiation  of  all  antigen- 
specific  immune  responses  is  T  lymphocyte  recognition  of 
processed  antigen  bound  to  molecules  encoded  by  the  major 
histocompatibility  complex  (Mhc) .   The  two  classes  of  Mhc 
molecules,  class  I  and  class  II,  bind  peptide  fragments  that 
are  derived  from  different  cellular  compartments,  and 
generated  by  various  antigen  processing  pathways.   This 
allows  T  lymphocytes  the  ability  to  efficiently  detect 
cellular  alterations  via  stimulation  of  their  clonally 
distributed  T  cell  antigen  receptor.   Regulatory  T 
lymphocytes  normally  recognize  antigen  bound  to  class  II 
molecules,  while  cytotoxic  T  lymphocytes  normally  recognize 
antigen  complexed  to  class  I  molecules.   Mhc  molecules 
specifically  bind  antigenic  fragments  in  their  antigen 
binding  site,  which  X-ray  crystallographic  analysis  of  a 
class  I  molecule  has  shown  to  be  a  groove  produced  by  two 
parallel  a-helices  overlaying  a  platform  composed  of  an 
eight  strand  /3-pleated  sheet  (Bjorkman  et  al.  1987a;  Brown 
et  al.  1988) .   Fragments  of  processed  antigen,  approximately 
9-17  amino  acids  in  length,  are  bound  with  micromolar 
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affinity  within  the  antigen  binding  site  groove  (Buus  et  al. 
1987)  . 

The  importance  of  Mhc  gene  products  to  immune 
recognition  has  dramatically  influenced  their  evolution, 
such  that  these  genes  exhibit  an  unparalleled  degree  of 
polymorphism.   Alleles  often  differ  in  greater  than  10%  of 
their  nucleotide  sequence,  and  most  of  this  diversity  is 
concentrated  in  the  exons  encoding  the  antigen  binding  site 
(Benoist  et  al.  1983;  Estess  et  al.  1986).   These 
polymorphisms  modify  the  functional  properties  of  Mhc 
molecules  in  antigen  presentation,  causing  changes  in  the 
immune  responsiveness  of  individuals  to  foreign  antigens. 

The  evolutionary  mechanisms  responsible  for  the 
generation  and  maintenance  of  Mhc  diversity  has  been  a 
controversial  issue  in  the  field  of  immunogenetics  for  many 
years.   The  goals  of  this  dissertation  are  to  elucidate  the 
molecular  mechanisms  responsible  for  this  unprecedented 
genetic  diversity,  and  to  determine  whether  selective 
pressures  are  acting  to  exclusively  diversify  the  antigen 
binding  site.   These  issues  were  approached  by  analyzing  the 
divergence  of  the  antigen  binding  site  in  the  murine  Mhc 
class  II  Ab  gene.   The  polymerase  chain  reaction  coupled 
with  DNA  sequencing  technology  was  employed  to  obtain  the 
nucleotide  sequence  of  46  alleles  of  Ab  exon  2  (which 
encodes  the  Ab  portion  of  the  antigen  binding  site) . 
Together  with  ten  published  sequences,  the  data  set  included 
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nucleotide  sequence  from  56  alleles  derived  from  12  Mus 
species  and  2   Rattus   species. 

The  results  of  this  analysis  indicate  that  the 
diversification  of  Ab  exon  2  is  generated  by  the  slow 
accumulation  of  point  mutations  predominantly  in  the  antigen 
binding  site  over  long  evolutionary  time  spans.   These  point 
mutations  frequently  result  in  nonsynonymous  (amino  acid 
replacement)  mutations,  and  often  change  the  biochemical 
class  of  amino  acid  at  that  position.   This  diversity  is 
then  extensively  amplified  by  mechanisms  that  dramatically 
modify  the  antigen  binding  site  in  a  single  step;  intra- 
exonic  recombination  between  Ab  alleles,  and  the 
introduction  of  amino  acid  deletions.   As  a  result,  natural 
mouse  populations  contain  an  array  of  alleles  with  highly 
divergent  binding  properties.   The  accumulation  of  such  rare 
and  unusual  events  specifically  within  the  antigen  binding 
site  of  Mhc  class  II  genes  suggests  that  specialized 
selective  mechanisms  may  favor  the  maintenance  of  alleles 
encoding  highly  divergent  forms  of  the  antigen  binding  site. 
This  type  of  selection,  referred  to  as  divergent  allele 
advantage,  may  act  in  concert  with  the  other  two  forms  of 
balancing  selection  (overdominance  and  rare  allele 
advantage)  and  drive  the  diversification  of  the  antigen 
binding  site  by  selectively  maintaining  the  most  divergent 
Mhc  alleles  within  populations. 


CHAPTER  II 
REVIEW  OF  THE  LITERATURE 


The  major  histocompatibility  complex  (Mhc)  was 
initially  detected  based  on  its  involvement  in  graft 
rejection  between  different  inbred  mouse  lines  (Little  and 
Tyzzer  1916) .   The  development  of  serological  techniques 
immensely  augmented  the  study  of  the  H-2  complex  (Gorer 
1936;  1938),  and  thus  allowed  for  a  greater  understanding 
and  appreciation  of  the  genetic  diversity  encoded  within  the 
genes  of  the  Mhc.   The  extensive  polymorphism  of  the  Mhc 
genes,  as  well  as  their  critical  role  in  regulating  immune 
responsiveness  has  made  their  study  particularly  interesting 
for  scientists  from  many  different  disciplines  of  biology. 
This  literature  review  will  concentrate  on  Mhc  class  II  gene 
structure,  function,  and  genetic  diversity. 

Genomic  Organization  of  the  Major  Histocompatibility  Complex 

The  murine  major  histocompatibility  complex,  referred 
to  as  H-2 .  is  a  large  multigene  family  located  on  chromosome 
17  of  the  mouse.   The  H-2  complex  encompasses  approximately 


2  centiMorgans  of  DNA,  which  is  equivalent  to  a  physical 
distance  of  at  least  2.4  megabase  pairs  (Hood  et  al.  1982; 
Klein  1975) .   Chromosomal  walking  and  mapping  techniques 
have  provided  a  detailed  picture  of  the  molecular 
organization  of  the  Mhc.   A  total  of  50  genes  have  been 
cloned  and  partially  characterized  from  the  Mhc  of  a  BALB/c 
mouse,  most  of  which  encode  immune  related  proteins.   A 
molecular  map  of  the  H-2  complex,  as  well  as  the  general 
protein  structure,  is  illustrated  in  Figure  2-1. 

Mhc  genes  encode  three  families  of  proteins  based  on 
their  structure  and  function.   The  H-2  complex  has 
accordingly  been  divided  into  four  regions  which  correspond 
to  the  class  of  molecules  encoded.   The  K  and  D  regions 
contain  the  class  I  genes,  of  which  there  are  two  general 
types.   First,  the  K,  p,  and  L  genes  encode  for  the 
classical  transplantation  antigens  and  are  expressed  on  most 
nucleated  cells  of  the  body.   These  molecules  are  extremely 
polymorphic,  and  as  such  are  responsible  for  mediating 
heterologous  graft  rejection.   Their  physiological  function 
in  vivo,  however,  is  to  present  viral  and  tumor  antigens  to 
cytotoxic  T  lymphocytes  (Zinkernagel  1979) .   The  second  type 
of  class  I  gene,  designated  Qa   and  Tla,  are  much  less 
polymorphic.   Molecular  cloning  studies  have  revealed  more 
than  32  genes  of  this  type  (Steinmetz  et  al.  1982a) ,  located 
telomeric  to  the  D  and  L  loci  (Winoto  et  aJL.  1983)  . 
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Although  some  of  these  genes  are  expressed  on  nucleated 
blood  cells  (Qa)    or  on  thymocytes  and  certain  leukemias 
(Tla)  (Michealson  et  al.  1983),  their  function  is  not  yet 
known  (Flaherty  1980) .   It  has  been  postulated,  however, 
that  Tla  molecules  may  serve  as  restriction  elements  for  the 

S   lineage  of  T  lymphocytes  (Janeway  et  al.  1989) . 

Both  types  of  class  I  molecules  have  a  similar  protein 
structure.   They  consist  of  three  extracellular  domains,  a 
transmembrane  domain,  and  a  cytoplasmic  domain,  thereby 
constituting  a  40-45,000  dalton  membrane  bound  glycoprotein 
of  approximately  350  amino  acids.   A  fourth  extracellular 
domain  is  contributed  by  /3-2  microglobulin.   This  12,000 
dalton  molecule,  encoded  on  chromosome  2  of  the  mouse, 
noncovalently  associates  with  class  I  proteins,  and  is 
thought  to  play  a  role  in  stabilizing  the  extracellular 
domain  structure  of  class  I  proteins  (Klein  et  al.  1983b) . 

The  S  region  encodes  a  heterogeneous  assortment  of 
genes.   Included  are  the  classical  class  III  genes,  which 
encode  the  complement  proteins  C2,  Bf ,  Sip,  and  C4,  as  well 
as  the  two  homologous  genes  21-OHA  and  21-OHB.  one  of  which 
codes  for  the  steroid  21-hydroxylase  (Steinmetz  et  al. 
1984).   Centromeric  to  the  D  locus  are  the  genes  for  two 
cytotoxins,  TNF-a  and  TNF-/3  (Muller  et  al.  1987).   Although 
these  S  region  genes  are  physically  located  within  the  H-2 
complex,  and  may  therefore  evolve  as  a  single  genetic  unit 

(Bodmer  1976),  Klein  et  al.  (1983b)  argue  against  their 
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inclusion  in  the  Mhc  because  they  are  not  functionally 
related  to  the  class  I  or  class  II  histocompatibility  loci. 

The  class  II  genes  are  contained  within  the  I  region, 
which  maps  between  the  K  and  S  regions  of  the  Hrl  complex 
(Figure  2-1) .   Class  II  genes  were  first  defined  by  the 
differential  ability  of  inbred  mouse  strains  to  mount  an 
immune  response  to  the  synthetic  peptide  (T,G)-A — L 
(McDevitt  and  Sela  1965) .   These  immune  response  genes  were 
then  definitively  mapped  using  recombinant  and  congenic 
strains  of  mice  (McDevitt  et  al.  1972;  Benacerraf  and 
McDevitt  1972) .   There  are  two  isotypic  forms  of  class  II 
molecules  encoded  within  the  I  region,  denoted  A  and  E,  and 
these  are  assembled  from  polypeptides  encoded  by  the  four 
functional  genes  Ab,  Aa,  Eb,  and  Ea.   In  addition,  there  are 
three  pseudogenes,  termed  Ab3,  Ab2,  and  Eb2  (Widera  and 
Flavell  1985;  Steinmetz  et  al.  1986).   Class  II  molecules 
are  heterodimeric  glycoproteins  composed  of  a  35,000  dalton 
a  (alpha)  chain  and  a  28,000  dalton  b  (beta)  chain  (Klein  et 
al.  1983b) .   These  polypeptides  noncovalently  associate  in 
the  cytoplasm  and  are  subsequently  expressed  on  the  surface 
of  antigen  presenting  cells.   Both  the  a  and  b  chains  are 
organized  into  5  protein  domains  including  a  hydrophobic 
leader  peptide  of  approximately  25  amino  acids  absent  in  the 
mature  cell  surface  form  of  the  molecule,  2  approximately  90 
amino  acid  extra-cellular  domains  (termed  al  a2  or  bl  b2) ,  a 
hydrophobic  transmembrane  segment  of  25  amino  acids,  and  a 
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10 
highly  charged  cytoplasmic  domain.   The  tertiary  structure 
of  the  a2,  bl,  and  b2  domains  is  abetted  by  the  formation  of 
disulfide  bonds  between  pairs  of  cysteine  residues  located 
within  each  domain  (Mengle-Gaw  and  McDevitt  1985) . 

The  domain  organization  of  class  II  polypeptides 
directly  reflects  the  exon/intron  organization  of  their 
respective  genes.   The  b  chain  genes,  for  instance,  are 
composed  of  six  exons,  one  for  each  protein  domain,  and  an 
additional  exon  encoding  the  3'  untranslated  region  (Saito 
et  al.  1983).   The  a  chain  genes  are  very  similar,  except 
that  the  transmembrane  and  cytoplasmic  domains  are  combined 
into  a  single  exon.   Thus,  they  are  composed  of  5  exons 
(Mathis  et  al.  1983;  McNichols  et  al.  1982).   A  diagram  of 
the  organization  of  class  II  a  and  b  genes  is  given  in 
Figure  2-2. 

Organization  of  the  I-Reqion 

The  I  region  was  originally  divided  into  5  sub-regions, 
I-A,  I-B,  I-J,  I-E,  and  I-C  based  on  recombinational 
analysis  of  various  immune  responsiveness  traits  (reviewed 
by  Klein  1975) .   The  I-A  and  I-E  subregions  encode  the 
serologically  and  biochemically  defined  A  and  E  molecules, 
which  are  the  immune  response  antigens.   The  Ab,  Aa,  and  Eb 
polypeptides  are  encoded  in  the  I-A  subregion  and  the  Ea 
chain  is  encoded  by  the  I-E  subregion  (Jones  et  al.  1978; 
Murphy  et  al.  1980) . 
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The  I-J  subregion  was  serologically  defined  by  reagents 

directed  against  an  I-J  polypeptide,  which  was  thought  to  be 

a  suppressor  T  cell  factor  capable  of  suppressing  immune 

responses  (Murphy  et  al.  1976;  Murphy  et  al.  1980). 

Although  these  I-J  suppressor  factors  have  been 

serologically  defined,  attempts  to  isolate  and  biochemically 

characterize  them  have  failed. 

The  existence  of  the  I-B  and  I-C  subregions  were  based 

entirely  on  regulatory  effects  on  immune  responsiveness. 

The  I-B  subregion  was  originally  defined  by  Leiberman  et  al. 
(1972)  for  its  ability  to  regulate  the  antibody  response  to 

an  allotypic  determinant  on  the  myeloma  protein  MOPC  173. 
Immune  responses  to  at  least  5  other  antigens  have  been 

attributed  to  the  B  region  including  lactate  dehydrogenase  B 
(Melchers  et  al.  1973),  staphylococcal  nuclease  (Lozner  et 
al.  1974),  oxazolone  (Fachet  and  Ando  1977),  H-Y  antigen 
(Hurme  et  al.  1978),  and  trinitrophenylated  mouse  serum 
albumin  (Urba  and  Hildemann  1978).   No  protein  product  has 
ever  been  detected  from  the  I-B  subregion,  and  its  effects 
can  be  explained  by  the  complimentation  of  gene  products 
from  both  the  I-A  and  I-E  subregions  (Dorf  and  Benacerraf 
1975;  Klein  et  al.  1981). 

The  C  locus  was  first  discovered  with  H-2  anti-H-2 
antiserum  (David  and  Shreffler  1974) .   Rich  et  al.  (1979a, 
1979b)  subseguently  produced  antisera  containing  C  specific 
antibodies  that  reacted  with  a  suppressor  factor  produced  in 
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allogeneic  mixed  lymphocyte  reactions.   Mapping  of  the  C 
subregion  using  recombinant  inbred  strains  suggested  a 
position  telomeric  to  the  Ea  locus  and  centromeric  to  C4. 
As  for  the  B  subregion,  no  protein  product  has  ever  been 

found . 

The  advent  of  molecular  genetic  analysis  has  allowed 
the  elucidation  of  a  molecular  map  of  the  I-region. 
Steinmetz  et  al.  (1982)  provided  the  first  evidence  at  the 
molecular  level  of  the  exact  linkage  of  class  II  genes  by 
cloning  a  stretch  of  200,000  contiguous  base  pairs  from  the 
I-region  of  a  BALB/c  mouse.   This  study  identified  3  of  the 
biochemically  defined  class  II  genes,  Ab,  Eb,  and  Ea?  and 
Eb2,  designated  a  pseudogene  because  it  did  not  hybridize  to 
a  5'  probe.   Southern  blot  analysis  of  the  BALB/c  genome 
suggested  that  the  I-region  contains  2  a  chain  genes  and 
from  4  to  six  b  chain  genes,  a  conclusion  later  confirmed  by 
the  work  of  Widera  and  Flavell  (1985).   Steinmetz  et  al. 
(1982)  also  showed  that  the  Ea  and  Eb  genes  are  in  fact 
present  in  strains  not  expressing  an  E  molecule.   Thus,  the 
failure  to  express  an  E  molecule  is  not  a  consequence  of  the 
deletion  of  the  entire  gene,  but  rather  must  occur  at  the 
level  of  transcription  or  translation.   Subsequent 
experiments  involving  the  screening  of  cosmid  libraries  by 
Davis  et  al.  (1984)  lead  to  the  identification  of  the  Aa 
gene,  and  it  was  mapped  just  telomeric  to  the  Ab  locus. 
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Comparison  of  the  molecular  map  of  the  I-region  with 
the  genetic  map  has  confirmed  the  location  of  the  Aa  and  Ab 
genes  in  the  I-A  subregion,  and  the  location  of  the  Ea  gene 
in  the  I-E  subregion.   The  Eb  gene,  however,  is  located  with 
its  5 '  end  in  the  I-A  subregion  and  its  3 •  end  in  the  I-E 
subregion.   This  confines  the  I-J  and  I-B  subregions  to  less 
than  3.4  Kb  of  DNA  at  the  3'  end  of  the  Eb  gene  (Steinmetz 
et  al.  1982).   Sequence  analysis  of  this  DNA  fragment 
definitively  showed  that  no  gene  could  encode  for  I-J  in 
this  segment  (Kobori  et  al.  1986) . 

Two  other  class  II  genes  have  subsequently  been 
discovered  and  determined  to  be  pseudogenes.   Larhammer  et 
al.  (1983a)  identified  the  Ab2  gene  and  mapped  it 
approximately  20  Kb  centromeric  to  the  Ab  gene.   Sequence 
analysis  of  the  Ab2  gene  and  an  Ab2  cDNA  clone  shows  the 
exon/intron  organization  to  be  the  same  as  other  class  II  b 
genes  (Larhammer  et  al.  1983b) .   The  predicted  amino  acid 
sequence  of  Ab2  shows  only  about  60%  homology  to  other  b 
chains.   In  contrast,  the  typical  homology  among  other  b 
chains  in  human  and  mouse  is  around  80%,  thus  indicating 
that  Ab2  is  the  most  divergent  member  of  the  family. 
Detection  of  incompletely  spliced  Ab2  mRNA  and  the  finding 
of  an  cDNA  clone  containing  intron  sequences  suggests  that 
Ab2  transcripts  are  not  properly  processed.   No  cell  surface 
product  has  been  isolated  from  the  Ab2  locus.   Hybridization 
to  restriction  enzyme-digested  genomic  DNA  of  different 
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inbred  strains  with  Ab2  probes  indicated  that  this  gene 
displays  a  lesser  degree  of  polymorphism  than  Ab. 

Widera  and  Flavell  (1985)  isolated  the  Ab3  gene.   It 
shows  83%  nucleotide  homology  with  the  human  SBb  gene  and 
strong  homology  with  other  class  II  b  genes.   However,  an  8 
nucleotide  deletion  makes  the  translation  of  this  gene  into 
a  functional  protein  an  impossibility. 

The  position  of  the  Ab3  gene  is  75  Kb  telomeric  the  K 
gene  (Widera  and  Flavell  1985) .   Steinmetz  et  al.  (1986) 
subsequently  linked  the  Ab3  gene  from  the  BALB/c  mouse  to 
the  rest  of  the  I-region,  thereby  providing  a  contiguous  600 
kilobase  map  of  the  K  and  I  regions  of  the  Mhc.   The 
organization  of  the  I-region  is  shown  in  Figure  2-3.   The 
genes  are  arranged  centromerically  in  the  order  of  Ab3 .  Ab_2, 
Ab,  Aa,  Eb,  Eb2,  and  Ea./  and   span  approximately  300 
kilobases  of  DNA,  with  the  functional  genes  confined  to  a 
110  kilobase  region. 

Homologous  Recombination  Within  the  Mhc 

The  molecular  cloning  and  characterization  of  large 
segments  of  the  Mhc  has  made  it  possible  to  map  meiotic 
recombinational  breakpoints  at  the  nucleotide  level. 
Steinmetz  et  al.  (1982a)  initially  analyzed  9  intra-I  region 
recombinant  mouse  strains  and  found  that  all  the 
recombinational  events  map  to  a  10  kilobase  segment  of  DNA 
covering  part  of  the  Eb  gene.   Subsequent  southern  blot  and 
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sequence  analysis  revealed  that  3  of  the  recombinational 
events  occurred  within  a  1  kilobase  region  of  DNA  in  the 
intron  between  the  Ebl  and  Eb2  exons  (Kobori  et  al.  1984; 
1986) .   Several  succeeding  studies  have  identified  3  more 
intra-I  region  recombinants  in  which  the  breakpoints  map  to 
this  recombinational  hotspot  (Saha  and  Cullen  1986a;  1986b; 
Lafuse  and  David  1986) .   The  finding  of  highly  localized 
meiotic  recombination  points  in  the  mouse  Mhc  indicates  that 
recombination  is  highly  focal,  and  the  genetic  and  physical 
maps  would  not  be  congruent. 

Further  studies  have  revealed  4  additional 
recombinational  hotspots  within  the  Mhc.   These  map  to  (1)  a 
40  kilobase  stretch  of  DNA  between  the  K  and  A  loci 
(Steinmetz  et  al.  1986;  Shiroshi  et  al.  1982).   (2)  A  9.5 
kilobase  region  of  DNA  between  the  Ab3  and  Ab2  loci 
(Steinmetz  et  al.  1986) .   Further  analysis  of  this 
recombinational  hotspot  by  Uematsu  et  al.  (1986)  revealed 
that  all  the  breakpoints  were  confined  to  a  3.5  kilobase 
stretch  of  DNA.   All  the  breakpoints  examined  showed 
homologous  recombination  without  any  DNA  sequences 
duplicated  or  deleted  between  the  parental  and  recombinant 
haplotypes.   (3)  Seven  breakpoints  have  been  characterized 
mapping  to  a  12-14  kilobase  region  centromeric  to  the  Ea 
gene  (Lafuse  and  David  1986) .   (4)  Another  recombinational 
hotspot  was  identified  by  Tarnuzzer  (1988),  that  maps  to  a 
4.7  kilobase  stretch  of  DNA  approximately 
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5  kilobases  telomeric  to  the  Aa  gene.   These  observations 
indicate  that  most  of  the  recombinations  within  the  H-2 
complex  occur  in  clusters,  defining  5  recombinational 
hotspots  shown  in  Figure  2-4. 

All  the  recombinational  hotspots  in  H-2  have  three 
characteristics  in  common:  (1)  high  freguency  of  homologous 
recombination,  (2)  localization  to  a  small  stretch  of  DNA, 
and  (3)  haplotype  specificity  (Steinmetz  et  a_l.  1987). 
Furthermore,  when  the  recombinational  hotspots  are  present, 
they  act  in  a  dominant  fashion  (Steinmetz  and  Uematsu  1987) . 

The  structural  basis  of  the  recombinational  hotspots 
within  the  Mhc  is  unknown  (Steinmetz  et  al.  1987) . 
Repetitive  seguences  have  been  identified  in  the  proximity 
of  the  Ab3/Ab2  and  Eb  hotspots  (Steinmetz  et  al.  1987; 
Uematsu  et  al.  1986) .   These  repetitive  seguences  have  been 
suggested  to  play  a  role  based  on  their  similarity  to  Chi,  a 
recombinational  hotspot  in  phage  lambda,  and  human 
hypervariable  minisatellite  seguences,  constituting  presumed 
hotspots  in  man.   These  similarities  may  therefore  indicate 
that  the  basic  mechanism  of  homologous  recombination  maybe 
similar  in  prokaryotes  and  eukaryotes. 

3-Dimensional  Structure  of  Mhc  Molecules 

A  major  advance  in  the  understanding  of  Mhc  molecules 
came  with  the  elucidation  of  the  3-dimensional  structure  of 
the  class  I  HLA-A2  molecule  (Bjorkman  et  al.  1987a).   Plasma 
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membranes  from  the  homozygous  human  lymphoblastoid  cell  line 
JY  were  digested  with  papain  to  remove  the  transmembrane 
anchor  of  the  HLA-A2  molecule.   The  soluble  fragment, 
containing  the  al,  a2,  a3,  and  /32M  domains,  was 
crystallized,  and  the  structure  was  then  determined  from 
3.5A"  X-ray  crystal lographic  analysis.   The  molecule  is 
comprised  of  two  structurally  similar  domains;  al  and  a2 
have  the  same  tertiary  folds,  and  likewise  a3  and  /?2M  have 
the  same  tertiary  folds.   The  a3  and  /32M  domains  are  both  0- 
sandwich  structures  composed  of  2  antiparallel  ^-pleated 
sheets,  one  with  4  /3-strands  and  one  with  3  /3-strands. 
These  two  sheets  are  connected  by  a  disulfide  bond.   This 
tertiary  structure  has  been  described  for  the  constant 
region  of  immunoglobulin  molecules,  and  is  consistent  with 
the  amino  acid  homology  between  the  2  molecules  (Orr  et  al. 
1979)  . 

The  al  and  a2  domains  interact  symmetrically  to  compose 
the  antigen  binding  site  (Bjorkman  et  al.  1987a) .   It  is 
located  on  the  top  surface  of  the  molecule,  distal  from  the 
membrane,  in  a  position  accessible  for  recognition  by 
receptors  from  the  surface  of  another  cell.   The  structure 
consists  of  two  parallel  a-helices,  each  span  a  platform 
composed  of  an  8  strand  antiparallel  /3-pleated  sheet 
structure.   The  antigen  binding  site  is  the  groove  that  lies 
between  the  two  a-helices  and  atop  the  /3-pleated  sheet.   The 
dimensions  of  the  antigen  binding  site  (ABS)  groove  are 
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approximately  25A°  long,  10A"  wide,  and  11A°  deep.   This 
would  accommodate  an  8-10  amino  acid  fragment  in  a  linear 
conformation  or  14-17  amino  acids  in  an  a-helical 
confirmation.   The  interior  of  the  antigen  binding  site  is 
lined  with  both  polar  and  nonpolar  amino  acid  side  chains, 
and  many  of  the  highly  polymorphic  amino  acids  responsible 
for  haplotype-specific  associations  with  antigen  are  located 
in  the  site  (Bjorkman  et  al.  1987b) . 

A  large  continuous  region  of  electron  density  that  is 
not  accounted  for  by  the  polypeptide  chain  of  the  HLA-A2 
molecule  was  observed  in  the  ABS  (Bjorkman  et  al.  1987a) . 
It  seems  likely  that  this  extra  density  is  from  a  peptide  or 
mixture  of  peptides  that  co-crystallized  with  the  Mhc 
molecule. 

Recently,  the  structure  of  HLA-AW68,  refined  to  a 
resolution  of  2.6A°,  was  reported  (Garrett  et  al.  1989). 
The  backbone  structure  of  the  two  HLA  class  I  molecules  was 
very  similar,  excluding  the  13  amino  acid  differences,  of 
which  10  are  in  amino  acid  positions  that  face  in  the 
antigen  binding  cleft.   These  amino  acid  differences 
individually  cause  only  local  structural  changes,  but 
overall  substantially  transform  the  ABS.   For  instance, 
comparison  of  the  structure  from  the  2  alleles  illustrates 
that  various  sub-sites  of  the  groove  have  contour  and 
charge-distribution  changes.   Furthermore,  the  physical 
characteristics  of  pockets  which  extend  between  the  a-helix 
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and  0-pleated  sheets,  and  are  thought  to  play  a  critical 
role  in  determining  peptide  binding  properties,  can  be 
highly  diversified  between  the  alleles.   This  is  due  to 
polymorphisms  that  result  in  amino  acid  side  chain  changes, 
differences  that  ultimately  dictate  physical  binding 
properties.   The  number  of  amino  acid  differences  between 
HLA-A2  and  HLA-Aw68  is  approximately  equal  to  the  average 
number  of  site  differences  between  pairs  of  HLA  alleles. 
Therefore,  the  same  degree  of  structural  changes  in  local 
pockets  and  sub-sites  should  be  observed  in  other  alleles. 

A  model  of  the  Mhc  class  II  ABS  has  been  proposed  based 
on  the  class  I  structure  and  the  pattern  of  polymorphism  of 
human  and  mouse  class  II  alleles  (Brown  et  al.  1988) .   The 
basic  3-dimensional  structure  is  the  same;  the  two  a-helices 
lying  atop  an  8  strand  /3-pleated  sheet  (Figure  2-5)  .   Both 
the  a  chain  and  the  b  chain  contribute  an  a-helix  and  4 
strands  of  the  /3-pleated  sheet.   There  are  regions  of  the 
model,  however,  whose  tertiary  structure  cannot  be 
accurately  predicted.   These  areas  in  the  b  chain  are  (1) 
the  loop  between  /3-strand  one  and  /3-strand  two,  (2)  the 
central  a-helix,  and  (3)  the  3'  segment.   The  undefined 
parts  of  the  a  chain  are  (1,2)  the  loops  between  /3-strand  1 
and  2,  and  between  /3-strands  3  and  4,  and  (3)  the  5'  a-helix 
segment.   Analysis  of  the  secondary  structure  of  class  II 
molecules  by  physical  criteria,  such  as  Fourier  transform 
infrared  and  circular  dichroism  spectroscopy,  are  consistent 
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with  the  class  II  model  proposed  by  Brown  et  al.  (1988) 
(Gorga  et  al.  1989).   Furthermore,  the  class  II  ABS  model  is 
consistent  with  a  variety  of  structural  and  functional 
studies  (Allen  et  al.  1987,  Buus  et  al.  1987). 

Generation  of  Mhc  Class  II  Gene  Polymorphism 

The  most  outstanding  feature  of  Mhc  genes  is  their 
unprecedented  genetic  polymorphism.   No  other  vertebrate 
genes  exhibit  such  a  high  degree  of  diversity  (Klein  1986) . 
Serological  studies,  tryptic  peptide  mapping,  and  molecular 
characterization  have  estimated  that  greater  than  100 
alleles  of  some  Mhc  loci  exist  in  natural  populations  of  Mus 
(Wakeland  and  Klein  1979;  Duncan  et  al.  1979a;  Gotze  et  al. 
1980;  Klein  and  Figueroa  1981;  1986).   Many  of  the  alleles 
are  globally-distributed  with  frequencies  ranging  from  1-10% 
in  wild  mouse  populations  (Gotze  et  al.  1980;  Nadeau  et  al. 
1981) .   In  addition,  greater  than  90%  of  wild  mice  are 
heterozygous  at  H-2  (Duncan  et  al.  1979b) ,  an  observation 
fully  consistent  with  the  high  degree  of  diversity  of  Mhc 
genes. 

Restriction  fragment  length  polymorphism  (RFLP) 
analysis  with  single  copy  probes  spanning  the  I-region 
reveals  variable  and  conserved  tracts  of  DNA  (Steinmetz  et 
al.  1984;  Tarnuzzer  1988).   The  centromeric  half  of  the  I- 
region,  that  encodes  the  Ab,  Aa,  and  5'  portion  of  the  Eb 
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gene,  shows  extensive  polymorphism  and  allelic  variability. 
On  the  other  hand,  the  telomeric  portion  of  the  I-region, 
encoding  the  31  portion  of  the  Eb  gene,  Eb2,  and  Ea, 
displays  little  polymorphism.   The  boundary  runs  through  the 
Eb  gene,  close  to  and  perhaps  overlapping  with  the 
recombinational  hotspot  in  the  intron  between  Ebl  and  Eb2 
exons. 

Nucleotide  sequence  comparisons  of  the  four  functional 
class  II  genes  derived  from  laboratory  strains  of  mice  is 
consistent  with  the  observations  made  at  the  RFLP  level;  Ab, 
Aa.  and  Eb  are  polymorphic,  whereas  Ea  is  not  (Benoist  et 
al.  1983a;  1983b;  Choi  et  al.  1983;  Malissen  et  al.  1983; 
Mengle-Gaw  and  McDevitt  1983;  Estess  et  al.  1986).   Allelic 
nucleotide  sequence  variation  can  be  extensive;  alleles  of 
Ab  or  Aa  commonly  differ  by  5-10%  of  their  nucleotide 
sequence  (Benoist  et  al.  1983b;  Estess  et  al  1986).   Thus, 
the  RFLP  and  nucleotide  sequence  data  suggest  that  the 
diversity  is  indeed  greater  in  both  the  coding  and  non- 
coding  regions  of  the  variable  tract  as  compared  to  the 
conserved  tract  of  the  I-region. 

Nucleotide  sequence  analysis  of  the  Ab,  Aa,  and  Eb 
genes  from  different  laboratory  strains  of  mice  all  indicate 
that  the  majority  of  the  diversity  is  localized  in  the  amino 
terminus  of  the  molecule,  specifically  the  al  and  bl  exons 
(Choi  et  al.  1983;   Benoist  et  al.  1983b;  Estess  et  al. 
1986) .   These  are  the  exons  that  encode  the  antigen  binding 
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site  (Brown  et  al.  1988) .   Closer  inspection  of  the 
nucleotide  sequence  variation  of  the  Ab,  ha,    and  Eb  genes 
reveals  that  most  of  the  substitutions  are  clustered  into 
regions  of  hypervariability  (Benoist  et  al-  1983b;  Mengle- 
Gaw  and  McDevitt  1983;  Estess  et  al-  1986).   This  diversity 
is  also  seen  at  the  amino  acid  level.   For  instance,  a  Kabat 
and  Wu  variability  plot  (Kabat  and  Wu  1970)  of  the  6  Aa 
alleles  sequenced  by  Benoist  et  al.  (1983b)  illustrates  that 
the  amino  acid  substitutions  fall  into  two  hypervariable 
regions  at  residues  11-15  and  residues  56-57. 

Hughes  and  Nei  have  examined  the  patterns  of  nucleotide 
substitutions  at  both  the  class  I  loci  (1988a),  and  the 
class  II  loci  (1989)  of  both  humans  and  mouse.   The  rates  of 
nonsynonymous  (replacement)  substitutions  versus  synonymous 
(silent)  substitutions  was  measured  for  the  various  domains 
of  the  Mhc  molecules.   In  both  class  I  and  class  II  loci, 
the  membrane  distal  domains  encoding  the  antigen  binding 
site  had  a  much  higher  rate  of  nonsynonymous  versus 
synonymous  substitutions.   This  was  contrasted  by  the  other 
parts  of  the  molecules  where  the  reverse  was  observed;  the 
rate  of  synonymous  substitutions  exceeded  that  of 
nonsynonymous  substitutions. 

These  observations  illustrate  the  extensive 
polymorphism  of  Mhc  class  II  genes,  and  imply  that  this 
genetic  diversity  reflects  a  unique  and  important  biological 
role  for  these  molecules.   The  functional  significance  of 
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this  polymorphism  is  still  unclear;  although,  it  is  thought 
to  directly  relate  to  disease  susceptibility.   In  addition, 
the  evolutionary  origin  of  Mhc  allelic  diversity  is  unknown; 
however,  two  hypotheses  dominate  speculations:  retention  of 
ancestral  polymorphisms  (Klein  1980;  1987),  and 
hypermutational  diversification  (Pease  1985) . 

Retention  of  Ancestral  Polymorphisms 

Klein  (1980)  first  postulated  that  the  extensive 
diversification  of  Mhc  alleles  could  be  explained  by  trans- 
species  evolution;  the  hypothesis  that  most  Mhc  alleles 
diverged  prior  to  the  origin  of  the  species  in  which  they 
are  presently  found.   The  divergence  of  contemporary  Mhc 
alleles,  therefore,  reflects  the  steady  accumulation  of 
mutations  over  long  evolutionary  timespans,  rather  than 
hypermutational  diversification  subseguent  to  speciation. 
The  hypothesis  of  retention  of  ancestral  polymorphisms 
postulates  that  allelic  lineages  of  Mhc  genes  are  maintained 
for  extremely  long  evolutionary  periods  in  natural 
populations,  independent  of  speciation  events  (Figure  2-6) . 
This  predicts  that  selection  may  act  to  maintain  specific 
sets  of  alleles  with  specific  antigen  binding  sites,  and 
conseguently  binding  properties.   The  most  common  allelic 
lineages  would  encode  antigen  binding  sites  which  are 
optimal  for  the  presentation  of  the  antigenicity  expressed 
by  the  prevalent  endemic  pathogens. 
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The  first  experimental  support  for  this  hypothesis 
demonstrated  that  Mhc  class  I  molecules  derived  from 
different  sub-species  of  the  Mus  musculus  complex  had 
identical  serological  reactivities  and  tryptic  peptide  maps 
(Arden  and  Klein  1982).    Direct  evidence  further 
illustrating  the  antiguity  of  Mhc  genes  was  reported  by 
McConnell  et  al.  (1988).   This  study  analyzed  Ab  gene 
polymorphism  by  RFLP  analysis,  and  revealed  that  greater 
than  90%  of  the  31  alleles  examined  could  be  organized  into 
two  evolutionary  lineages  based  on  the  presence  or  absence 
of  an  861  basepair  retroposon  insertion  into  the  intron 
between  the  Abl  and  Ab2  exons.   The  flanking  direct  repeats 
of  host  derived  seguences  on  either  side  of  the  retroposon 
indicate  that  the  insertion  into  this  position  was  a  random 
event  during  the  evolutionary  divergence  of  Ab.   Ab  alleles 
with  and  without  the  retroposon  insertion  were  found  in  4 
species  and  sub-species  of  the  genus  Mus.  demonstrating  that 
this  polymorphism  arose  in  the  ancestors  of  modern  Mus 
species,  and  was  maintained  as  a  polymorphism  across 
multiple  speciation  events.   These  findings  have 
subsequently  been  extended  to  115  independently  derived  Ab 
alleles,  representing  9  different  species  or  sub-species  of 
the  genus  Mus.   Ab  alleles  from  both  lineages  are  present  in 
Mus  carol i.  demonstrating  that  alleles  in  these  two  lineages 
diverged  at  least  8  million  years  ago  (Figure  2-7) (Lu  et  al. 
1990) . 
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Nucleotide  sequence  analysis  of  Ab  alleles  has  revealed 
that  some  of  these  genes  have  a  deletion  of  two  codons, 
while  others  do  not  (Choi  et  al.  1983;  Estess  et  al.  1986). 
The  deletions  occur  in  exon  2  at  amino  acid  positions  65  and 
67.   Figueroa  et  al.  (1988)  report  2  Ab  specific  monoclonal 
antibodies  that  correlate  perfectly  with  the  two  types  of  Ab 
genes.   The  H-2A.m27  antibody  reacts  with  the  Ab  chains  that 
have  the  two  deletions,  and  the  H-2A.m25  antibody  reacts 
with  Ab  chains  that  are  undeleted.   Strains  that  are 
homozygous  for  Ab  show  a  perfectly  antithetical  relationship 
between  the  determinants  that  these  antibodies  detect,  they 
are  either  m25-positive  and  m27-negative  or  vica  versa.   No 
molecule  has  been  found  that  reacts  with  both  antibodies, 
and  only  one  molecule  reacts  with  neither  of  the  two 
antibodies.   Utilizing  these  two  antibodies  and  Northern 
blot  hybridization  with  allele-specif ic  oligonucleotides, 
Figueroa  et  al.  (1988)  where  able  to  demonstrate  the 
presence  or  absence  of  the  amino  acid  position  65/67 
deletion  polymorphism  in  10  species  and  sub-species  of  the 
genus  Mus.  in  addition  to  Rattus  norveqicus.   This  data 
indicates  that  the  65/67  deletion  polymorphism  already 
existed  in  the  last  common  ancestor  of  mice  and  rats. 

A  number  of  different  mutations  affecting  both  the  a 
and  b  chains  of  the  E  molecule  can  result  in  E  molecule  non- 
expression  (Jones  et  al.  1990)  .   (These  mutations  will  be 
discussed  at  greater  depth  in  the  following  sections  of  this 
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literature  review.)   Many  of  these  mutations  can  be 
identified  in  mice  from  multiple  species  and  sub-species  of 
the  genus  Mus,  indicating  that  the  mutations  were  already 
present  in  nascent  species  and  survived  multiple  speciation 
events . 

The  fact  that  many  of  these  polymorphisms  are  found  in 
multiple  species  requires  that  the  different  alleles  be 
present  at  relatively  high  frequencies  in  the  species- 
founding  populations.   These  founding  populations  must  also 
have  been  of  reasonably  large  sizes.   If  either  of  these  two 
criterion  had  not  been  met,  there  would  be  a  high  likelihood 
of  losing  the  polymorphism  by  random  drift,  particularly  the 
retroposon  polymorphism  in  the  intron  of  Ab,  where  selection 
would  presumably  act  at  a  minimum  (Nei  1987) . 

These  findings,  together  with  similar  results  for 
primates  (Lawler  et  al-  1988;  Parham  et  al.  1989;  Gyllensten 
and  Erlich  1989;  Mayer  et  al.  1988),  demonstrate  that  the 
retention  of  ancestral  polymorphisms  over  extremely  long 
evolutionary  periods  can  account  for  the  extensive  diversity 
seen  in  modern  Mhc  alleles  in  natural  populations  of 
rodents . 

Hvpermutational  Diversification  of  Class  II  Genes 

The  presence  of  hypervariable  regions  of  DNA  within 
class  II  genes  suggests  that  hypermutational  mechanisms  such 
as  gene  conversion  or  segmental  exchange  may  be  operating  to 
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rapidly  diversify  the  regions  of  Mhc  genes  responsible  for 
immune  responsiveness.   This  hypothesis  predicts  that  most 
of  the  polymorphism  will  be  generated  within  the  lifetime  of 
a  species,  potentially  allowing  alleles  to  rapidly  adapt  to 
changes  in  the  antigenicity  of  endemic  pathogens  (Pease 

1985) . 

Gene  conversion  or  segmental  exchange  was  originally 
defined  in  fungi  (Radding  et  al.  1978) ,  and  is  a  mechanism 
by  which  DNA  sequence  is  copied  or  transferred  from  one  gene 
to  another.   Although  the  DNA  sequences  can  be  transferred 
to  and  from  genes  anywhere  in  the  genome,  it  is  more  common 
to  occur  within  multigenic  or  multiallelic  families 
(Baltimore  1981;  Robertson  1982;  Slightom  et  al.  1980). 
Gene  conversion  is  defined  by  the  DNA  transfer  between 
discrete  loci,  whereas  intragenic  segmental  exchange  occurs 
between  alleles  of  a  particular  locus.   Pairing  between 
partially  homologous  sequences  during  meiosis  or  mitosis  is 
followed  by  mismatch  repair  thereby  converting  part  of  one 
sequence  to  that  of  another.   The  primary  evidence  for  gene 
conversion  events  is  the  clustering  of  nucleotide 
substitutions.   This  pattern  of  diversity  is  clearly 
documented  in  Mhc  class  I  genes  (Mellor  et  al.  1983;  Weiss 
et  al.  1983;  Nathenson  et  al.  1986;  Geliebter  and  Nathenson 

1987)  . 

Direct  evidence  for  gene  conversion  in  Mhc  class  II 
genes  has  been  reported  by  Mengle-Gaw  et  al-  (1984),  where 
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an  alloreactive  T  cell  clone,  4.1.4,  recognized  a 
determinate  present  on  both  the  Eb  and  Ab   ^molecules . 
Nucleotide  sequence  comparisons  between  Ab  .  Ab   .  and  Eb 
(Choi  et  al.  1983;  Mclntyre  and  Seidman  1984)  revealed  that 
the  Ab   sequence  is  identical  to  Eb  in  the  region  where  it 
differs  from  Ab  .   The  region  exchanged  must  have 
encompassed  a  minimum  of  14  nucleotides,  because  the  3 
nucleotide  changes  occurred  in  this  14  base  pair  stretch. 
The  area  of  exchange  is  flanked  by  regions  of  exact  homology 
extending  for  20  nucleotides  51  and  9  nucleotides  3'. 

McConnell  et  al.  (1988)  demonstrated  evidence  for 
segmental  exchange  occurring  in  Mhc  class  II  genes.   By 
examining  the  nucleotide  sequence  of  eight  alleles  of  Ab. 
the  sequence  of  the  Ab2  exon  of  all  8  alleles  corresponded 
to  the  appropriate  genomic  evolutionary  lineage,  as  defined 
by  the  retroposon  insertion.   However,  the  nucleotide 
sequence  of  the  Abl  exon  of  two  of  the  8  alleles,  Ab.  and 
Ab  .  did  not  reflect  their  evolutionary  lineaqe,  and 
therefore  reflects  the  exchange  of  sequence,  by  segmental 
exchange,  from  alleles  of  a  different  evolutionary  lineage 
(Figure  2-8) . 

At  present,  the  relative  importance  of  recombinational 
mechanisms  versus  the  accumulation  of  point  mutations  over 
long  evolutionary  periods  in  the  generation  of  Mhc  class  II 
gene  diversity  has  yet  to  be  determined.   This  is  one  the 
goals  of  this  dissertation. 


T3 

0) 

.2   <0   Q) 

o 
c 

O         d) 

°-    IC 
03         fj 

0) 

(0        <M 

01    m 

O 

Q) 

j)  oj  m 

X   C  X 

^  H      , 

+J   0-H 

in  w 

<H    0 

■PH+i 

«J  <o  S 

o  a 

o    • 

X!         2 

w  u  e  -p  • )  e. 

ft-P   Q) 

03  -P   ^ 

•H    <D  X 

x  n  -p 

g         W 

to 

03  tn  .. 

cue 

■H   C   °. 

OX!  <1)  X  -H  T 

•h  +j  a) 

-p      £ 

<C  X  -P 

£  e  £  <* 

H   U  £  *-' 
0         g 

H4J  m 

Q)-H  XI 

U    S 

C 

QUO 

Cu  a) -H   . 

X  C  M 

a)  eh  ^  (ol 

-P   0  +J 

X  C 

cn  a)  -h 

1  .-g-a 

<D-H    3  rH 

03  X  £   0J 

ft  P   C 

O   O   ft  0 

c 

•h  (n  a) 

N  Xj  X 
•H  rtj  -P 

M 

(0  -0  C 

i  c-h 

g   (0 

3         ^ 
03  H   3 

X   1   -  O 

e^  o 

a)  o  ^ 
>  ft-0  g 

n)      o 

m  a) 

(0        ...   0 

tPX  XI  X!   :: 

(0  -P   0 

03    03    2 

•H        -H 

-OCX 

O   O^TJ 

-H   £  H   ft  „    0) 

< 

a)  o  H  -P 

MlllH^ft 

g     E   rH   -P     ..    (0 

•  u  to 

(0   0)   a  TJ 

00  -H  -H 

^  5  rt: 

1   X!  XI  H 

-P   *   03 

a)  o  o 

3  && 

-P              -P 

X  X,v  c 

0">rH  iH 

Oi -Pi 

a> 

•H   0   0 

•H-H  X! 

> 

fe  a  a  <u  >  < 

a) 

42 


C\] 


c 
o 

QCL   X 
LJJ 


c 
o 


CO 

c 
g 

"tr 

CD 
CO 

c 


_  c 
o 

LU 


CD 


O  03  CD 

C  C  O) 

CD  CD 

g)  c 

2  &  o 

■^  CD  X 

.b  CO  CD 


c 
cs 


cr 


CO 


O 

c 


CD 

O) 

03 

CD 

C 


CM 


CO 


43 


Functional  Role  of  Mhc  Polymorphism 

Regulatory  T  lymphocytes  are  responsible  for  initiating 
and  coordinating  antigen  specific  immune  responses. 
Activation  of  virgin  T  regulatory  cells,  or  regulatory  T 
cells  that  have  not  come  into  contact  with  their  specific 
ligand,  is  dependant  upon  a  set  of  signals  delivered  by  the 
antigen  presenting  cell.   This  stringency  is  designed  to 
maintain  the  specificity  of  the  resultant  immune  response, 
and  ensure  the  inactivity  of  autoreactive  T  cells.   First, 
regulatory  T  lymphocytes  must  recognize  processed  peptide 
antigen  bound  to  molecules  encoded  by  the  I-region  of  the 
major  histocompatibility  complex.   This  recognition  is 
achieved  via  T  cell  surface  structures  including  the 
clonally  distributed  T  cell  antigen  receptor,  and  the  co- 
receptor  molecule  CD4.   Second,  the  antigen  presenting  cell 
must  provide  a  costimulatory  signal,  such  as  the  membrane 
form  of  interleukin-1.   The  regulatory  T  cell  must  receive 
both  of  these  signals  in  tandem.   Either  signal  alone  is  not 
sufficient  to  induce  T  lymphocyte  activation,  or  the 
subseguent  immune  response  to  the  antigen  from  which  the 
peptide  was  derived. 

Class  II  molecules  play  a  crucial  role  in  this  antigen 
presenting  cell-T  cell  interaction.   Their  function  is  to 
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act  as  promiscuous  receptors  for  antigen  fragments;  thereby 
making  them  recognizable  to  T  lymphocytes.   These  two 
events,  antigen  binding  by  class  II  molecules,  and 
regulatory  T  lymphocyte  recognition  of  the  resultant 
bimolecular  complex,  not  only  forms  the  basis  for  antigen 
specific  immune  responsiveness,  but,  in  addition,  determines 
to  a  large  extent  the  intensity  of  the  ensuing  immune 
response. 

Regulation  of  Expression  of  Class  II  Molecules 

Consistent  with  their  function  to  bind  and  present 
antigen  to  regulatory  T  lymphocytes,  the  expression  of  class 
II  molecules  is  restricted  to  the  antigen  presenting  cells 
of  the  body.   These  antigen  presenting  cell  types  include 
macrophage,  dendritic  cells,  B  lymphocytes,  and  thymic 
epithelial  cells.   Macrophage  are  the  primary  antigen 
presenting  cell  in  the  body  (Unanue  and  Allen  1987) ,  and  as 
such  have  the  unigue  ability  to  trigger  virgin  T  cells 
(Lassila  et  al.  1988) .   However,  resting  macrophage  do  not 
constitutively  express  class  II  molecules  on  their  surface, 
rather  cell  surface  expression  is  under  both  positive  and 
negative  control  (Steinman  et  al.  1980;  Snyder  et  al.  1982). 
Supernatants  of  mitogen  activated  T  cells  induce  the  cell 
surface  expression  of  class  II  molecules  on  macrophage 
(McNichols  1982) .   Biochemical  analysis  of  the  inducing 
component  of  these  supernatants  have  determined  the  factor 
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to  be  gamma-interferon  (Steeg  et  al.  1982;  King  and  Jones 
1983) .   Gamma-interferon  increases  the  cell  surface 
expression  of  both  the  A  and  E  molecules,  as  well  as  class  I 
molecules.   This  control  appears  to  act  at  the  level  of 
transcription,  such  that  there  is  a  coordinate  increase  in 
the  level  of  mRNA  of  all  four  class  II  chains  within  8  hours 
of  treatment  with  gamma-interferon  (Paulnock-King  et  al. 
1985) .   Prostoglandins,  glucocorticoids,  and  the  bacterial 
endotoxin  LPS  have  all  been  shown  to  have  a  negative  effect 
on  the  cell  surface  expression  of  class  II  molecules  (Snyder 
et  al.  1982;  Aberer  et  al.  1984;  Steeg  et  al.  1982). 
Precursor  B  lymphocytes  do  not  express  class  II 
molecules;  however,  mature  B  cells  and  plasma  cells  show 
heterologous  constitutive  levels  of  class  II  on  their 
surface  (Mond  et  al.  1981;  Monroe  and  Cambier  1983).   The 
levels  of  class  II  expression  on  resting  B  cells  can  be 
augmented  by  incubation  with  mitogen  activated  T  cell 
supernatants  (Roehm  et  al.  1984),  and  subseguent  studies 
have  shown  the  factor  responsible  for  this  to  be 
interleukin-4  (BSF-1) (Noelle  et  al.  1984) .   Interleukin-4 
can  induce  the  levels  of  class  II  mRNA  within  one  hour  and 
cell  surface  expression  levels  as  early  as  two  hours  after 
incubation  of  B  cells  (Polla  et  al.  1986) .   B  lymphocytes  do 
not  have  the  ability  to  activate  virgin  T  cells  (Lassila  et 
al.  1989) .   However,  they  may  play  an  integral  role  in 
antigen  presentation  during  a  secondary  T  cell  response 
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because  of  their  ability  to  pick  up  and  display  minute 
quantities  of  antigen  (Lanzavecchia  1985) . 

These  induction  mechanisms  for  class  II  molecule 
expression  illustrates  the  importance  of  class  II  molecule 
cell  surface  expression  to  the  interaction  of  regulatory  T 
lymphocytes  and  antigen  presenting  cells  resulting  in  an 
immune  response.   Furthermore,  this  expression  of  class  II 
molecules  on  limited  cell  types  ensures  regulatory  T  cell 
reactivity  can  take  place  only  while  interacting  with 
selected  cells  of  the  body.   This  introduces  a  control 
mechanism  to  ensure  the  inactivity  of  autoreactive  T  cells. 

The  Functional  Expression  of  Class  II  Molecules 

Initial  serological  and  biochemical  characterization  of 
Mhc  class  II  molecules  revealed  a  heterodimeric  glycoprotein 
requiring  the  association  of  both  the  a  and  b  chains  (Jones 
1977;  Jones  et  al.  1978).   Serological   analysis  of  class  II 
molecules  expressed  in  inbred  and  wild  mice  has  shown  the  A 
molecule  expressed  in  all  populations  of  mice  examined. 
However,  four  of  eleven  inbred  strains  and  5-30%  of  wild 
haplotype  mice  fail  to  express  an  E  molecule  on  the  cell 
surface  (Jones  et  al.  1981;  Nizetic  et  al.  1984) .   Analysis 
of  the  Ea  and  Eb  polypeptides  of  the  four  inbred  E~ strains 
by  2-dimensional  gel  electrophoresis  revealed  that  the  H-2 
and  the  H-2smice  synthesize  normal  Eb  chains  but  do  not 
express  Ea  chains,  whereas  the  H-2  and  H-2^mice  do  not 
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synthesize  either  Ea  or  Eb  chains  (Jones  et  al.  1978;  Jones 
et  al.  1981) .   Recently  the  molecular  defects  resulting  in  E 
molecule  non-expression  have  been  identified,  and  thus  far, 
seven  independent  defects  have  been  detected  (Jones  et  al. 

1990) . 

b         s 
The  Ea  gene  of  the  H-2  and  H-2  haplotvpes  have  a  627 

base  pair  deletion  encompassing  the  promoter  and  first  exon 

(Mathis  et  al.  1983).   The  Ea5  gene  has  a  single  nucleotide 

insertion  in  codon  64,  causing  a  frameshift  leading  to  a 

f 
stop  codon  at  position  69  (Vu  et  al.  1989)  .   The  Ea.  gene 

also  has  a  single  nucleotide  insertion,  but  at  codon  -2, 

thereby  generating  a  downstream  stop  codon  (Vu  et  al.  1989) . 

The  Eb  mutation  of  the  H-2v3lnd  H-2w2%aplotypes  is  a 

single  nucleotide  substitution  at  codon  7  generating  a  stop 

codon.   There  are  also  two  independent  mutations  occurring 

in  the  RNA  donor  splice  site  at  the  first  exon-intron  border 

of  the  Eb  gene  (Tacchini-Cottier  and  Jones  1988;  Vu  et  al. 

1988) .   Both  of  these  mutations  lead  to  aberrant  RNA 

processing.   Jones  et  al.  (1990)  also  report  another  Eb 

mutation  distinct  from  the  first  three,  but  have  not  as  yet 

molecularly  characterized  it.   All  the  defects  described 

causing  E  molecule  non-expression,  with  the  exception  of  the 

f 
insertion  affecting  the  Ea  gene  of  H-2  .  have  also  been 

found  in  various  wild  mouse  haplotypes  (Jones  et  al.  1990; 

Dembic  et  al.  1984) .   The  large  number  of  mice  not 
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expressing  an  E  molecule  may  indicate  that  the  two  Mhc  class 
II  molecules  are  not  functionally  equivalent. 

Chain  Association  of  Class  II  Molecules 

The  extensive  polymorphism  of  Mhc  class  II  molecules 
together  with  the  critical  nature  of  their  function  of 
binding  antigen  allowing  T  lymphocyte  recognition  suggests 
that  individuals  expressing  a  greater  variance  of  class  II 
molecules  on  the  cell  surface  of  an  antigen  presenting  cell 
would  be  at  a  selective  advantage.   Fathman  and  Kimoto 
(1981)  observed  that  the  a  and  b  chains  of  a  given  isotype 
can  transassociate  in  heterozygotes.   These  findings  gave 
rise  to  the  notion  of  free  association  of  allelic  varients 
within  an  isotype,  suggesting  that  4  types  of  class  II 
heterodimers  will  form  in  a  heterozygote.   In  contrast, 
cross-isotype  pairing  of  A  and  E  molecule  polypeptide  chains 
does  not  occur  except  in  artificial  experimental  systems 
(Murphy  et  al.  1980) .   Preferential  isotypic  pairing  is  due 
to  a  strong  increased  affinity  for  the  association  of 
isotype  matched  pairs  of  polypeptides  (Sant  and  Germain 
1989)  . 

Numerous  observations  now  suggest  that  preferential 
pairing  of  certain  allelic  A  molecule  polypeptide  chains 
limits  the  amount  of  transassociated  A  molecules  that  can  be 
formed.   Tryptic  peptide  analysis  from  serologically  related 
groups  of  mice  (Wakeland  and  Klein  1983)  show  the  Aa  and  Ab 
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polypeptides  from  these  strains  differ  by  less  than  10%  of 
their  tryptic  peptides  (Wakeland  and  Darby  1983). 
Restriction  fragment  length  polymorphism  analysis  of  the  Aa 
and  Ab  chain  genes  from  this  same  allelic  family 
corroborates  this  observation  at  the  DNA  level  (Tarnuzzer 
1988;  McConnell  et  al.  1986).   These  observations  suggest 
that  the  Aa  and  Ab  genes  on  the  same  chromosome  accumulate 
mutations  in  a  coordinate  manner,  thereby  ensuring  their 
ability  to  functionally  associate. 

Gene  transfection  experiments  by  Germain  et  al.  (1985) 
clearly  illustrated  that  allelic  variation  can  dramatically 
affect  the  ability  of  A  molecule  subunits  to  assemble 
correctly,  and  be  expressed  on  the  cell  surface.   These 
studies  showed  that  haplotype  mismatched  chains  cannot 
associate  as  efficiently  as  haplotype  matched  chains,  and 
therefore  are  not  expressed  at  appreciable  levels.   Further 
analysis  indicated  that  polymorphisms  in  the  amino  terminal 
half  of  the  Abl  domain  consistently  controlled  a  and  b  chain 
interactions  (Braunstein  and  Germain  1987) .   Buerstedde  et 
al.  (1988) ,  using  site-directed  mutagenesis  and  DNA  mediated 
gene  transfer,  have  shown  that  amino  acid  positions  9,  12, 
13,  14,  and  17  of  the  Abl  exon  are  responsible  for  proper 
chain  association  and  cell  surface  expression  for  the  H-2 
and  H-2k  haplotypes  examined.   The  amino  acid  positions  12 
and  13  being  particularly  significant  for  proper 
association. 
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These  studies  suggest  that,  in  order  for  proper  subunit 
association  and  cell  surface  expression,  the  a  and  b  chains 
of  the  A  molecule  need  to  be  co-adapted,  and  therefore  be 
from  the  same  or  similar  haplotype  (Figure  2-9) . 

The  ability  of  polypeptide  chains  of  the  E  molecule  to 
associate  is  under  different  selective  pressures.   In  the 
case  of  the  E  molecule,  only  the  Eb  chain  is  highly 
diversified  while  Ea  exhibits  very  low  levels  of  diversity. 
Therefore,  the  diversification  of  Eb  is  only  constrained  by 
the  requirement  to  associate  with  an  essentially  monomorphic 
Ea  (Figure  2-9) .   This  discrepancy  in  selective  pressures  of 
the  various  class  II  genes  to  properly  associate  may  be  due 
in  part  to  the  presence  of  a  recombinational  hotspot  in  the 
second  intron  of  Eb.   Homologous  recombination  at  this 
position  would  not  allow  co-evolution  of  the  two  genes. 

The  Role  of  the  Invariant  Chain  in  Class  II  Molecule 
Expression 

Mhc  class  II  molecules  are  associated  intracellularly 

with  a  third  glycoprotein  called  the  invariant  chain  (Ii) , 

which  displays  little  allelic  variation  among  the  different 

strains  of  mice  examined  (Jones  et  al.  1979) .   The  invariant 

chain  is  a  basic  polypeptide  of  31,000  daltons  that  is 

coprecipitated  with  class  II  molecules  in 

immunoprecipi tat ions  using  anti-la  antisera  of  monoclonal 

antibodies.   It  noncovalently  associates  with  class  II 
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molecules  in  the  membranes  of  the  endoplasmic  reticulum,  but 
has  not  been  detected  on  the  cell  surface  in  association 
with  class  II  molecules  (Sung  and  Jones  1981) .   It  has  been 
demonstrated  that  the  invariant  chain  is  coordinately 
regulated  with  class  II  molecules  (Koch  et  al.  1984; 
Paulnock-King  et  al.  1985) .   Although  the  function  of  the 
invariant  chain  is  unclear,  it  has  been  suggested  that  it 
plays  a  role  in  the  assembly  and  intracellular  transport  of 
class  II  molecules  to  the  cell  surface  (Sung  and  Jones  1981; 
Jones  et  al.  1979)  . 

The  Presentation  of  Antigen  by  Class  II  Molecules 

Regulatory  T  lymphocytes  recognize  the  bimolecular 
ligand  of  foreign  antigen  and  a  self  class  II  molecule  on 
the  surface  of  antigen  presenting  cells.   However,  unlike  B 
lymphocytes  which  directly  interact  with  antigen,  most  T 
lymphocytes  only  recognize  a  non-native  form  of  the  antigen 
(Schwartz  1985) .   The  conversion  of  an  antigen  from  a  native 
to  a  non-native  form  has  been  termed  antigen  processing,  and 
is  performed  by  antigen  presenting  cells  which  express  class 
II  molecules  on  their  surface.   Although  much  is  still 
unknown  about  the  intricacies  of  antigen  processing,  the 
following  is  a  summary  of  events  (Werdelin  et  al.  1989; 
Germain  1988) . 

The  first  step  involved  in  antigen  processing  is 
ingestion  of  the  antigen.   Macrophage  accomplish  this  by 
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constitutive  endocytosis,  whereas  B  lymphocytes,  by  virtue 
of  their  immunoglobulin  receptor,  utilize  receptor-mediated 
endocytosis.   The  ingested  antigen  is  transported  into  the 
interior  of  the  cell  in  an  endocytic  vacuole. 

The  second  step  of  the  process  takes  place  when  the 
endocytic  vacuole  becomes  acidified  and  proteolytic  enzymes 
with  an  acid  optimum  become  activated.   This  results  in  the 
partial  degradation  of  the  antigen;  hence,  the  antigen  is 
broken  down  into  peptide  fragments. 

The  third  step  in  antigen  processing  is  the  binding  of 
antigenic  fragments  to  Mhc  class  II  molecules.   This 
presumably  occurs  in  an  intracellular  compartment,  but 
exactly  where  in  the  cell  this  occurs  is  not  known.   Once  an 
antigenic  fragment  is  bound  to  a  class  II  molecule,  it  is 
protected  from  complete  proteolytic  destruction.   However, 
parts  of  the  antigen  fragments  which  .are  outside  the  antigen 
binding  site,  may  not  be  protected  against  further 
degradation. 

The  fourth  step  consists  of  transporting  the  class  II 
molecule-processed  antigen  fragment  complex  back  to  the 
surface  of  the  antigen  presenting  cell.   Once  there,  the 
class  II  molecule  acts  to  keep  the  antigen  fragment  in  a 
constant  orientation  with  a  stable  conformation,  thereby 
allowing  recognition  by  a  T  lymphocyte. 

The  processing  requirements  may  vary  with  each 
particular  antigen,  depending  on  the  conditions  required  to 
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induce  the  conformational  flexibility  needed  for  the  antigen 
to  bind  a  class  II  molecule  (Allen  1987) .   For  instance, 
some  proteins  may  require  no  processing,  because  at  least  a 
portion  of  the  protein  has  enough  freedom  in  its  native 
state  to  become  stably  bound  to  a  class  II  molecule.   Other 
proteins  may  simply  need  denaturation,  such  as  a  reduction 
and  alkylation  of  disulfide  bonds,  to  reveal  peptide 
fragments  able  to  bind  to  class  II  molecules.   The  most 
stringent  antigen  processing  would  require  proteolytic 
cleavage  of  the  native  protein.   Irrespective  of  the  type  of 
antigen  processing  necessary,  the  immunogenic  peptide  must 
possess  two  distinct  features.   First,  it  must  be  able  to 
bind  to  a  class  II  molecule,  and  the  class  II  molecule 
contact  sites  of  an  immunogenic  peptide  is  called  an 
agretope  (Haber-Katz  et  al.  1983) .   The  immunogenic  peptide 
must  also  make  contact  with  the  T  cell  antigen  receptor,  and 
this  site  on  the  peptide  is  termed  ah  epitope. 

The  first  direct  evidence  for  peptide-class  II  molecule 
association  came  from  Babbitt  et  al.  (1985)  using  an 
equilibrium  dialysis  method  employing  purified  class  II 
molecules  and  peptide  fragments.   These  experiments  showed  a 

peptide  from  hen  egg  lysozyme,  HEL  46-61,  previously 

k  d 

demonstrated  to  be  immunogenic  for  H-2  and  not  for  H-2  . 

k  d 

bound  specifically  to  A_  molecules  but  not  to  A^  molecules  in 

a  saturable  process,  with  an  affinity  in  the  micromolar 

range.   This  direct  correlation  between  antigen-class  II 
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molecule  interaction  and  Mhc  restriction  was  subsequently 
extended  for  numerous  other  antigens  (Buus  et  al.  1986a; 
1986b;  1987;  Guillet  et  al.  1987).   Furthermore,  inhibition 
analysis  illustrated  that  peptides  restricted  to  a 
particular  class  II  molecule  competitively  inhibited  one 
another  from  binding.   This  suggested  that  a  class  II 
molecule  contained  just  a  single  antigen  binding  site  (Buss 
et  al.  1987;  Babbitt  et  al.  1986;  Guillet  et  al.  1987),  an 
observation  in  agreement  with  X-ray  crystallographic 
analysis  (Bjorkman  et  al.  1987a) . 

Utilizing  a  gel  filtration  system  enabling  complexes  of 
antigenic  peptides  and  class  II  molecules  to  be  separated 
from  unbound  peptide,  Buus  et  al.  (1986b)  were  able  to  study 
the  kinetics  of  association  and  disassociation  of  these 


complexes.   These  experiments,  using  the  ovalbumin  323-339 
peptide/A_  system,  illustrated  that  the  rate  of  complex 
formation  is  very  slow  (Ka  1M  s   but  once  formed  the  class 


II  molecule  peptide  complex  is  remarkably  stable  (Kd  3xl0~  s~ 

)  .  This  suggests  that  the  association  of  class  II 
molecules  with  antigen  fragments  most  likely  occurs  in  an 
intracellular  vesicular  compartment  as  opposed  to  the  plasma 
membrane,  since  this  would  prevent  soluble  processed  antigen 
from  diffusing  away  from  the  membrane  bound  class  II 
molecule.   This  intracellular  compartment  would  probably 
have  a  neutral  pH,  based  on  a  10-fold  slower  rate  of  complex 
formation  at  pH  4.6,  compared  with  pH  7.2,  and  the  lability 
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of  preformed  complexes  to  acid  pH.   This  peptide-class  II 
molecule  complex  sensitivity  to  acid  pH  may  represent  a 
mechanism  by  which  class  II  molecules  could  rid  itself  of 
complexed  peptide,  and  be  available  to  bind  newly  processed 
antigen.   Recycling  of  class  II  molecules  (Pernis  1985) 
together  with  biosynthesis  (Harding  et  al.  1989)  could 
effectively  prevent  potentially  constant  saturation  of  class 
II  molecule  binding  sites  with  peptides  derived  from  self- 
proteins. 

Babbitt  et  al.  (1986)  first  observed  that  class  II 
molecules  can  bind  antigenic  fragments  of  self-proteins. 
Lorenz  and  Allen  (1988;  1989)  further  characterized  the 
ability  of  class  II  molecules  to  bind  self -peptides.   These 
studies  provided  direct  functional  proof  in  vivo  that  self 
proteins  are  processed  const itutively,  and  can  be  presented 
in  a  fashion  similar  to  that  by  which  foreign  antigens  are 
presented.   In  addition,  experiments  by  Adorini  et  al. 
(1988;  1990)  demonstrate  that  peptides  of  foreign  antigens 
generated  by  processing  events  must  compete  for  binding  to 
class  II  molecules  with  peptides  generated  from  self- 
antigens  in  vivo.   Thus,  self-tolerance  does  not  occur  at 
the  level  of  the  antigen  presenting  cell,  because  antigen 
presentation  does  not  discriminate  self  from  non-self. 
Rather,  it  occurs  at  the  level  of  the  regulatory  T 
lymphocyte,  either  through  functional  or  physical  deletion 
of  self-reactive  T  cells. 
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Any  given  class  II  molecule  can  bind  a  wide  variety  of 
peptides,  however  different  class  II  molecules  show  distinct 
broad  specificity  patterns.   This  is  reflected  in  the 
variation  between  alleles  in  their  capacity  to  present 
different  peptides  to  the  immune  system.   For  instance,  when 
overlapping  peptides  comprising  an  entire  protein  have  been 
analyzed  for  reactivity,  different  Mhc  class  II  molecules 
have  been  found  to  present  different  peptide  determinants  to 
T  lymphocytes  (Roy  et  al.  1989;  Allen  et  al.  1987). 
Furthermore,  class  II  molecules  only  bind  a  subset  of 
peptides  derived  from  native  protein  (Braciale  et  al.  1989) , 
and  there  is  a  definite  hierarchy  of  peptide  determinants 
that  are  immunodominant  for  particular  allelic  forms  of  Mhc 
gene  products  (Ria  et  al.  1990;  Roy  et  al.  1989;  Berzofsky 
et  al.  1989) .   Which  immunodominant  region  of  the  native 
protein  is  ultimately  recognized  by  T  lymphocytes  is 
predominantly  influenced  by  the  particular  Mhc  class  II 
allele  expressed.   This  influence  reflects  the  ability  of 
processed  fragments  to  bind  to  a  particular  class  II 
molecule,  and  demonstrates  the  affect  Mhc  class  II  molecule 
polymorphisms  have  in  controlling  an  immune  response 
(Benacerraf  1978;  Babbitt  et  al.  1985;  Buus  et  al.  1987). 
In  conclusion,  these  studies  of  Mhc  molecule-antigen 
interaction  illustrate  the  broad  specificity  of  the  class  II 
molecule  antigen  binding  site.   Although  the  interaction  is 
generally  permissive,  the  direct  correlation  of  peptide 
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binding  and  Mhc  restriction  powerfully  illustrates  the 
crucial  role  class  II  molecules  play  in  the  initiation  of  a 
T  cell  dependent  immune  response. 

Wild  Mice 

The  goals  of  this  dissertation  are  to  elucidate  the 
evolutionary  mechanisms  responsible  for  generating  Mhc  class 
II  gene  polymorphism,  and  examine  the  role  selection  plays 
in  driving  this  extensive  diversification.   Previous  studies 
addressing  these  questions  utilized  techniques,  such  as 
serology  and  tryptic  peptide  mapping  which  have  a  limited 
capacity  to  answer  these  questions,  as  compared  to  obtaining 
the  DNA  sequence  of  the  genes.   The  nucleotide  sequence  of  a 
limited  number  of  Mhc  class  II  genes  has  been  obtained  only 
from  a  few  standard  inbred  laboratory  strains  of  mice. 
Aside  from  having  uncertain  genetic  origins,  inbred  strains 
of  mice  were  derived  from  a  limited  number  of  sources  that 
were  generated  by  a  high  degree  of  inbreeding.   This 
represents  a  biased  sampling  of  the  mouse  population  and  an 
artificial  collection  of  considerable  homogeneity  (Ferris  et 
al.  1982;  Klein  1974). 

Wild  mice  are  unconfined  animals  whose  breeding  is  not 
controlled  by  man  (Bruell  1970) ,  and,  as  such,  represent  a 
collection  of  I-region  haplotypes  of  significant 
heterogeneity,  particularly  when  compared  to  standard 
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laboratory  inbred  strains  of  mice.   A  number  of  features 
make  the  study  of  the  evolutionary  dynamics  of  the  wild 
mouse  Mhc  particularly  attractive.   Natural  populations  of 
wild  mice  are  abundant  and  their  phylogenetic  relationships 
have  been  extensively  characterized.   Furthermore,  these 
mice  represent  the  product  of  evolutionary  processes  where 
the  I-region  haplotypes  are  fixed  and  maintained  through 
natural  selection. 

Natural  History  of  Wild  Mice 

Wild  mice  can  be  divided  into  3  categories  of  animals 
depending  on  their  association  with  man:  aboriginal, 
commensal,  and  feral  (Sage  1981).   Aboriginal  mice  are 
genuinely  wild,  with  essentially  no  interaction  with  man. 
With  the  exception  of  one  subspecies  that  is  indigenous  to 
northwest  Africa,  aboriginal  mice  are  found  only  on  the 
Eurasian  continent.   Typically,  they  are  dry-area  animals, 
and  feed  on  grass,  seeds,  and  grain. 

Commensal  mice,  on  the  other  hand,  live  in  close 
association  with  man,  and  rely  on  man  for  their  main  source 
of  food  and  shelter.   Marshall  (1981)  distinguishes  4 
commensal  subspecies  of  Mus  musculus;  M.  m.  domesticus.  M. 
m.  musculus.  M.  m.  castaneus.  and  M.  m.  molossinus. 
Commensal  mice,  like  aboriginal  mice  are  also  indigenous  to 
Eurasia,  and,  in  addition  have  radiated  to  habitats 
throughout  the  world.   They  have  successfully  adapted  to  the 
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extremely  varied  climatic  conditions  of  environments  ranging 
from  Europe,  the  Americas,  Australia,  Africa,  and  several 
south  Pacific  islands.   Ferris  et  al.  (1983)  estimate  that 
the  commensal  relationship  between  mouse  and  man  has  existed 
for  approximately  1  million  years.   This  is  based  on  fossil 
evidence,  nuclear  DNA  variation,  and  mitochondrial  DNA 
variation. 

Feral  mice  were  once  commensals  of  man,  but  reverted  to 
a  more  aboriginal  existence  (Bruell  1970) .   They  are  found 
in  areas  such  as  agricultural  fields,  open  grasslands, 
marshes,  sandhills,  and  coastal  islands,  and  feed  on  grass 
and  grain  (Sage  1981) .   Permanent  reversion  to  feral  habits 
primarily  occurs  only  in  dry  climatic  zones. 

Mus  musculus  domesticus  is  presently  found  throughout 
the  world.   However  it  originated  in  western  Europe  and 
subsequently  spread  to  the  Americas  and  Australia  in 
association  to  the  global  movements  of  Europeans  (Bonhomme 
1986a) .   Mus  musculus  musculus  is  endemic  to  northeastern 
Europe  and  central  Asia  (Sage  1981) ;  Mus  musculus  castaneous 
is  found  in  Malaya  (Harrison  1955) ,  India  (Srivastva  and 
Wattel  1973),  Indonesia  (Hadi  et  al.  1976),  and  Nepal  and 
Thailand  (Marshall  1977) .   The  native  range  of  Mus  musculus 
molossinus  is  eastern  Asia,  particularly  Japan  and  Korea 
(Hamijima  1962;  Jones  and  Johnson  1965).   Mus  spretus  is  a 
feral  species  endemic  to  the  western  rim  of  the  Mediterrean 
Sea  (Bonhomme  1986b) .   Mus  specilegus  are  the  aboriginal 
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mound-building  mice  found  in  the  steppe  of  eastern  Europe 
(Petrov  1979) .   Mus  spretoides  is  found  in  eastern  Europe, 
the  Balkan  penninsula,  Cypress,  and  Turkey  (Bonhomme  et  al. 
1984) .   Mus  cookii,  Mus  cervicolor.  and  Mus  carol i  are  all 
endemic  to  southeast  Asia  (Marshall  1986) .   The  natural 
range  of  Mus  olatvthrix  is  India  (Marshall  1986) . 

Phylogenetic  Relationships  in  the  Genus  Mus 

The  phylogenetic  relationships  of  the  various  species 
within  the  genus  Mus  have  been  extensively  studied  (Bonhomme 
1986a) ,  and  a  general  understanding  of  their  relationships 
can  be  inferred  (She  et  al.  1989) .   Different  species  are 
distinguished  from  subspecies  based  on  the  presence  of 
reproductive  barriers  in  natural  populations.   Therefore,  M_;_ 
m.  domesticus  and  M.  m.  musculus  can  interbreed  in  natural 
habitats,  and  in  regions  where  they  come  into  contact,  such 
as  central  Europe,  form  a  tightly  defined  hybrid  zone.   In 
contrast,  different  species  with  overlapping  ranges,  such  as 
M.  m.  domesticus  and  M.  soretus  do  not  interbreed  in  natural 
populations.   Although,  these  two  species  can  be  bred  in  an 
laboratory  environment,  the  resultant  male  hybrids  are 
commonly  sterile. 

The  three  major  molecular  techniques  employed  for 
biochemical  systematics  include  protein  electrophoresis, 
single  copy  nuclear  DNA  (sen  DNA)  hybridization,  and 
mitochondrial  (mt)  DNA  restriction  fragment  length 
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polymorphism  (RFLP)  analysis  (She  et  al.  1989).   Protein 
electrophoresis  assays  only  the  polymorphism  in  the  coding 
regions  of  the  genome,  and  therefore  is  likely  to  be 
constrained  by  natural  selection.   Sen  hybridization  studies 
reveal  differences  between  two  genomes  of  all  single  copy 
DNA,  including  exons,  introns,  and  flanking  sequences. 
Mitochondrial  DNA  RFLP  analysis,  on  the  other  hand,  assays 
the  cytoplasmic  genome  which  has  several  unique 
characteristics;  such  as  a  high  evolutionary  rate,  strictly 
maternal  inheritance  patterns,  and  an  absence  of 
recombination . 

Figure  2-10  illustrates  the  phylogenetic  relationship 
within  the  genus  Mus  and  Rattus  as  determined  by  DNA-DNA 
hybridization  studies  (She  et  al.  1989).   Similar 
phylogenetic  relationships  are  obtained  when  these  species 
are  compared  by  other  techniques;  however,  the  estimates  of 
the  exact  genetic  distance  among  the  Mus  species  vary 
depending  upon  the  technique  used.   In  comparing  9  species 
of  Mus,  5  subspecies  of  Mus  musculus.  and  species  from  the 
genus  Rattus.  there  are  seven  levels  of  divergence  among  the 
species,  ranging  from  0.3  million  years  (Mus  musculus 
complex)  to  10  million  years  (divergence  between  species 
within  the  genus  Mus  and  the  genus  Rattus)  (Luckett  and 
Hartenberger  1985;  She  et  al.  1989). 

By  analyzing  the  Mhc  class  II  gene  nucleotide  sequence 
from  wild-derived  alleles  from  a  number  of  different  species 
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and  subspecies  of  the  genus  Mus  and  Rattus,  it  should  be 
possible  to  obtain  an  evolutionary  perspective  of  the  forces 
acting  to  diversify  and  maintain  contemporary  Mhc  alleles. 
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CHAPTER  III 
MATERIALS  AND  METHODS 


Isolation  of  Genomic  DNA 


Genomic  DNA  was  isolated  from  liver  or  kidney  tissue  by 
a  Protease  K/SDS  method  as  detailed  in  Maniatis  et  al. 
(1982) .   The  extraction  was  performed  on  a  340A  Applied 
Biosystems  Inc.  (Foster  City,  CA)  nucleic  acids  extractor. 
Frozen  tissues  are  ground  in  a  mortar  containing  liquid  N2 
to  a  fine  powder,  and  added  to  3.5  ml  solution  of  lysis 
buffer  (Applied  Biosystems  Inc.,  Foster  City,  CA)  and 
Protease  K  (final  concentration  of  0.3  mg/ml)  (Applied 
Biosystems  Inc.,  Foster  City,  CA) .   The  solution  was 
incubated  overnight  at  65 °C.   The  remainder  of  the 
extraction  was  performed  by  the  machine.   Briefly,  the  DNA 
solution  was  extracted  two  times  with  a  Tris  equilibrated 
phenol  (pH  7.5)/chloroform/isoamyl  alcohol  solution  (25:1 
v/v) ,  and  one  time  with  just  the  chloroform/ isoamyl  alcohol 
solution.   The  genomic  DNA  was  then  ethanol  precipitated, 
washed  in  70%  ethanol,  and  resuspended  in  TE  buffer  (10  mM 
Tris  HCL,  pH  7.5;  1  mM  EDTA) ,  and  dialyzed  overnight  at  12 'C 
against  TE  buffer.   The  resulting  DNA  solutions  were 
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electrophoresed  on  a  0.7%  agarose  gel  for  quantification  and 
to  confirm  their  high  molecular  weight. 


Polymerase  Chain  Reaction  Amplification.  Cloning,  and 

Sequencing 


Amplification  of  Ab  exon  two  was  achieved  via  the 
polymerase  chain  reaction  (PCR)  described  by  Saiki  et  al. 
(1985)  with  slight  modification.   The  initial  100  fj.1 
reaction  mixture  contained  1  /xg  of  genomic  DNA,  50  mM  KC1, 
10  mM  tris  (pH  8.3),  1.5  mM  MgCl2,  0.01%  gelatin,  0.5%  DMSO 
(v/v) ,  and  250  mM  of  each  dNTP  (dATP,  dCTP,  dGTP,  and  dTTP) . 
80  pmoles  of  each  oligonucleotide  primer,  which  are 
complementary  to  stable  intron  sequences,  were  also 
included:  mouse  2:  CACGGCCCGCCGCGCTCCCGC;  mouse  3: 
CGGGCTGACCGCGTCCGTCCGCAG.   Samples  were  then  boiled  for  ten 
minutes,  quenched  on  ice,  and  5  U  Taq  DNA  polymerase 
(Perkin-Elmer  Cetus,  Norwalk,  CT)  was  added.   The  first  five 
amplification  rounds  consisted  of  1  minute  denaturing  at 
94 "C,  2  minutes  annealing  at  25 °C,  and  3  minutes  extension 
at  72  °C.   At  this  point,  200  iil  of  dH20,  5%  DMSO  (v/v)  was 
added  with  an  additional  5  U  Taq  DNA  polymerase.   The 
amplification  protocol  for  the  ensuing  23  cycles  consisted 
of  1  minute  94 "C,  2  minutes  62 "C,  and  3  minutes  72 °C. 
Immediately  following  the  last  cycle  was  a  7  minute  72 "C 
chase  to  ensure  full  extension  of  all  amplified  fragments. 
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The  samples  were  then  ethanol  precipitated  and 
electrophoresed  through  a  5%   nondenaturing  acrylamide  gel. 
The  fragment  of  interest  was  then  excised  and  eluted  into  3 
ml  of  elution  buffer;  0.5  M  ammonium  acetate,  0.1%  SDS,  and 
1  mM  EDTA,  at  50 °C  overnight.   The  mixture  was  then 
centrifuged  at  2,000  rpm  for  10  minutes,  the  supernatant  was 
ethanol  precipitated,  the  pellet  was  washed  with  70% 
ethanol,  dried  and  resuspended  in  10  /xl  of  dH20.  The 
amplified  fragment  was  then  ligated  with  Sma  1  digested 
M13mpl8  overnight  at  25 °C  under  conditions  described  by  the 
supplier  (Bethesda  Research  Laboratories,  Bethesda,  MD) . 
Insert-positive  plaques  were  sequenced  via  the  Sanger 
dideoxyribonucleoside  method  employing  the  Sequenase 
protocol  (United  States  Biochemical,  Cleveland,  OH).   To 
eliminate  potential  errors  introduced  by  the  PCR,  at  least  2 
clones  per  sequence  were  analyzed. 


Spleen  Cell  Isolation.  Immunostaining,  and  Flow  Cytometric 

Analysis 


Freshly  explanted  spleens  were  minced  through  wire 
screens  to  make  a  single  cell  suspension.   Red  blood  cells 
were  lysed  by  incubating  the  cells  in  a  IX  ammonium  chloride 
solution  for  5  minutes  at  25 °C.   The  remaining  spleen  cells 
were  washed  thoroughly  with  IX  PBS.   1  x  10  cells  were 
resuspended  in  400  pi    IX  PBS,  0.1%  NaN3  solution,  and  then 
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incubated  in  a  1:2  dilution  of  the  monoclonal  antibody 
culture  supernatant  for  30  minutes  at  4°C.   The  samples  were 
washed  3  times  with  IX  PBS  and  incubated  in  a  800  nl   volume 
of  a  1/500  dilution  of  FITC  (Accurate  Chemical  and 
Scientific  Corp.,  Westbury,  NY)  in  IX  PBS,  0.1%  NaN3  f or  30 
minutes  at  4°C.   The  samples  were  again  washed  3  times  with 
IX  PBS  and  brought  up  in  a  400  jil  volume  for  flow  cytometry. 
The  cells  were  passed  through  a  4  micron  nylon  mesh  filter 
and  analyzed  on  a  FACSTAR  fluorescence  activated  cell  sorter 
(Becton-Dickinson,  Mountain  View,  CA)  at  a  flow  rate  of  300 
cells/second. 

Data  Analysis 

The  DNA  sequence  was  analyzed  by  the  following  computer 
programs.   The  nucleotide  alignment  and  amino  acid 
translation  was  achieved  using  Microgenie  (Beckman, 
Fullerton,  CA) .   The  allelic  phylogenies  were  constructed 
using  the  DNAPARS  and  DNACOMP  programs  in  the  PHYLIP  package 

(Felsenstein  1989),  and  the  neighbor- joining  and  UPGMA 
programs  (provided  by  M.  Nei) .   Nucleotide  divergence  and 
diversity  was  calculated  with  the  SYNO  and  SEND  programs 

(Nei  and  Jin  1989) . 


CHAPTER  IV 
RESULTS 


The  Generation  of  Mhc  Class  II  Ab  Gene  Polymorphism  in  Rodents 


The  mechanisms  responsible  for  generating  antigen 
binding  site  polymorphisms  in  rodents  has  been  a  puzzle  to 
immunogeneticists  for  many  years.   In  an  attempt  to  assess 
the  roles  mutational  and  recombinational  processes  play  in 
diversifying  MHC  class  II  genes,  the  nucleotide  seguence  of 
46  alleles  of  Ab  exon  2  (Abl  exon)  was  determined  and  the 
patterns  of  diversification  examined.   This  is  the  most 
polymorphic  exon,  and  it  contains  the  antigen  binding  site. 
These  alleles  were  obtained  from  a  panel  of  rodents 
containing  12  Mus  species  and  sub-species  and  2  species  of 
Rattus ;  thus  providing  alleles  derived  from  species  diverged 
for  increasing  amounts  of  evolutionary  time  up  to  10  million 
years. 
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Animals 

The  DNA  analyzed  in  this  study  was  isolated  from  fresh 
tissues  or  ethanol  preserved  tissues  from  various  species  of 
rodents.   The  standard  laboratory  inbred  mice  were  from  the 
mouse  colony  in  the  Tumor  Biology  Unit  at  the  Department  of 
Pathology  and  Laboratory  Medicine,  University  of  Florida. 
H-2  homozygous  wild  mice,  whose  origins  and  characteristics 
have  been  described  previously  (Wakeland  et  al.  1987) ,  are 
from  our  wild  mouse  colony  located  at  the  Animal  Care 
Facility,  University  of  Florida.   Some  wild-mouse  derived 
strains  were  supplied  by  F.  Bonhomme's  laboratory  in 
Montpellier,  France.   Three  individuals  of  Rattus  rattus 
were  trapped  locally  in  Gainesville,  Florida.   The  strains 
included  in  this  portion  of  the  analysis  are  listed  in  Table 
4-1. 

Nucleotide  Diversity  Within  the  Abl  Exon 

i 

The  polymerase  chain  reaction  coupled  with  DNA 
sequencing  technology  was  employed  to  obtain  the  nucleotide 
sequence  of  46  alleles  of  the  Abl  exon  (27  alleles  sequenced 
by  S.A.B.;  19  sequenced  by  J.X.  She).   These  sequences  were 
combined  with  10  previously  reported  laboratory  mouse  and 
rat  sequences  to  provide  a  data  base  of  56  sequences 
(Malissen  et  al.  1983;  Larhammer  et  al.  1983a;  Eccles  and 
McMaster  1985;  Estess  et  al.  1986;  Acha-Orbea  and  McDevitt 


Table   4-1.      List   of  Ab  Alleles  Analyzed. 
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Allele 


Species 


Strain 


Geographic  Origin 


nod 


Ab° 
At/ 
At/ 
Ab* 

At/ 

Ab* 

Ab' 
Ab' 
Ab" 

Ab' 

HudoAb' 

HudoAb2 

HudoAb* 

HudoAb' 

HudoAb5 

HudoAb" 

MudoAb" 

Ab" 

HudoAb7 

HudoAb^ 

MudoAby 

HudoAb" 

HudoAb'' 

HudoAb'' 

HudoAb'- 

HudoAb'; 

HudoAb'' 


HumuAb'=Abf 

HumuAb2 

HumuAb3 

MumuAb 

HumuAb5 

HymuAbl5=HudoAb* 

HucaAb' 
HucaAb2 


M. 

m. 

domesticus 

C57BI/6 

Lab  inbred 

H. 

m. 

domesticus 

BALB/C 

Lab  inbred 

H. 

m. 

domesticus 

B10.H 

Lab  inbred 

H. 

m. 

domesticus 

B10.BR 

Lab  inbred 

H. 

m. 

domesticus 

C3H.NB 

Lab  inbred 

H. 

m. 

domesticus 

B10.6R 

Lab  inbred 

H. 

m. 

domesticus 

B10.RIII 

Lab  inbred 

H. 

m. 

domesticus 

A.SW 

Lab  inbred 

M. 

m. 

domesticus 

B10.PL 

Lab  inbred 

M. 

m. 

domesticus 

NOD 

Lab  inbred 

M. 

m. 

domesticus 

DR1 

Florida 

M. 

m. 

domesticus 

B10.CAA2 

Hichigan 

M. 

m. 

domesticus 

B10.STC77 

Hichigan 

H. 

m. 

domesticus 

B10.SAA48 

Hichigan 

H. 

m. 

domesticus 

B10.STC90 

Hichigan 

M. 

m. 

domesticus 

B10.BUA16 

Hichigan 

H. 

m. 

domesticus 

ERFOUD5 

Horocco 

H. 

m. 

domesticus 

AZR0U1 

Horocco 

H. 

m. 

domesticus 

AZR0U3 

Horocco 

H. 

m. 

domesticus 

HETK0VIC2 

Yugoslavia 

H. 

m. 

domesticus 

ERFOUD1 
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1987;  Figueroa  et  al.  1988).   By  aligning  all  the  sequences 
in  our  panel  it  was  observed  that  the  two  codon  deletion  at 
amino  acid  positions  65  and  67  were  erroneously  placed 
previously  in  the  literature  (Estess  et  al.  1986) .   Shifting 
the  position  of  the  two  deletions  to  nucleotide  positions 
175-177  and  185-187  results  in  two  less  nucleotide  mis- 
matches between  the  two  forms  of  Abl.   The  nucleotide 
sequence  of  all  the  alleles  is  shown  in  Figure  4-1. 

DNA  sequence  analysis  of  these  56  sequences  revealed  52 
different  alleles  in  the  data  set;  4  pairs  of  identical 
alleles  were  found  in  independent  samples  from  the  Mus 
musculus  complex.   The  nucleotide  diversity  between  alleles 
was  computed  using  Nei  and  Jin's  program  (1989).   Most 
allelic  comparisons  revealed  5-15%  sequence  diversity  in  the 
Abl  exon;  although  some  alleles  differed  by  as  much  as  25% 
in  comparisons  both  within  and  between  mouse  species.   The 
maximum  value  of  sequence  divergence  was  32.7%,  and  occurred 
between  a  rat  allele  (Rara  )  and  a  mouse  allele  (MudoAb5) . 
The  mean  nucleotide  sequence  diversity  among  alleles  within 
a  Mus  species  (D  =  7.57  ±  0.7%)  was  comparable  to  that 
observed  between  species  (D  =  7.73  ±  0.3%)  and  between  all 
alleles  in  the  genus  (D  =  7.68  ±  0.3%).   These  results 
indicate  that  the  diversification  of  MHC  genes  is 
independent  of  the  phylogenetic  relationships  within  the 
genus  Mus,  and  contrasts  diversification  patterns  of  other 
nuclear  genes  (She  et  al.  1989) . 
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Phyloqerietic  Relationships  of  the  Abl  Alleles 

The  phylogenetic  relationships  of  these  Abl  alleles 
were  analyzed  using  both  phynetic  or  distance  methods,  such 
as  neighbor- joining  and  UPGMA  (Saitou  and  Nei  1987;  Sokal 
and  Sneath  1963) ,  and  parsimony  analysis  (DNAPARS  program  in 
PHYLLIP)4Felsenstein  1989) .   The  distance  methods  determine 
the  allsllic  relationships  by  comparing  the  total  sequence 
divergence  between  alleles;  whereas  parsimony  analysis  forms 
a  networJc  basing  the  genealogy  on  the  fewest  number  of 
mutatioftS= between  alleles.   Similar  results  were  obtained 
for  all  three  methods  of  analysis,  and  Figure  4-2 
illustrates  the  allelic  genealogy  produced  using  UPGMA  based 
on  the  JE**C  distance.   The  results  show  that  alleles  in 
separate  species  are  commonly  more  related  than  alleles 
within  the  same  species.   This  observation  is  consistent 
with  the ^retention  of  ancestral  polymorphisms.   However, 
these  analyses  revealed  very  few  tightly-clustered  allelic 
lineages  that  were  stably  maintained  over  evolutionary  time 
spans.   There  are  only  6  lineages  of  closely  related 
alleles,  or  alleles  with  less  than  2%  nucleotide  sequence 
divergence.   These  lineages  were  strictly  comprised  of 
alleles  derived  from  the  same  or  closely  related  species, 
with  divergence  times  of  less  than  1-2  million  years.   Most 
of  the  Abl  alleles  could  not  be  organized  into  homogeneous 
lineages.   For  example,  if  lineages  were  defined  as  alleles 
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81 
containing  less  than  5%  nucleotide  divergence  in  the  Abl 
exon,  25  separate  singleton  alleles  remained.   This 
indicates  that  most  of  the  lineages  defined  in  the  analysis 
are  not  retained  as  stable  polymorphisms  for  more  than  1-2 
million  years.   This  observation  sharply  contrasts  the 
results  reported  by  Gyllensten  and  Erlich  (1989)  for 
polymorphisms  of  the  primate  class  II  DQa  gene. 

Parsimony  analysis  revealed  a  second  interesting 
feature  of  the  diversification  patterns  among  these  alleles. 
The  most  parsimonious  network  for  all  52  alleles  required 
405  character  changes  and  only  20  of  the  82  (24.2%) 
informative  sites  (nucleotide  positions  exhibiting  at  least 
2  character  states  represented  by  2  or  more  alleles  in  the 
data  set)  were  compatible  with  the  genealogy.   These  results 
indicate  that  the  amount  of  homoplasy,  or  reverse,  parallel, 
or  convergent  mutations,  is  excessive,  and  suggests  that  the 
observed  evolutionary  relationships  are  not  reliable.   An 
example  of  the  homoplasy  in  the  data  set  is  illustrated  in 
Figure  4-3.   The  actual  analysis  was  done  with  nucleotide 
sequences,  but  the  presentation  of  the  results  is  simplified 
by  showing  the  protein  sequence.   The  carboxyl  terminal 
regions  of  the  Ab^  and  MumuAb  alleles  (underlined)  are 
identical  to  that  of  Ab    although  the  remainder  of  their 
sequences  are  clearly  different.   These  results  might  be 
explained  by  convergent  evolution,  but  the  amount  and  the 
patterns  of  homoplasy  observed  are  best  explained  by 
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postulating  intra-exonic  recombinational  events  among  the 
alleles. 

The  Abl  Exon  Consists  of  Five  Polymorphic  Sub-Domains 

The  majority  of  polymorphisms  in  the  Abl  exon  occurs  in 
5  specific  regions,  termed  polymorphic  segments.   These 
segments  contain  56  of  the  82  (68.3%)  informative  sites  for 
parsimony  analysis,  and  are  identified  in  Figure  4-4.   Each 
polymorphic  segments  encodes  a  specific  element  of  the 
hypothetical  class  II  antigen  binding  site,  as  shown  in 
Figure  4-5.   The  BS1,  BS2,  and  BS3  segments  are  located 
within  the  region  of  the  Abl  exon  that  encodes  the  /3-pleated 
sheets  of  the  antigen  binding  site,  the  a-helix  segment  is 
located  in  the  51  end  of  the  region  that  encodes  the  a-helix 
of  the  ABS,  and  the  3'  segment  is  located  at  the  end  of  the 
Abl  exon  in  a  region  that  encodes  a  portion  of  Ab  whose 
structure  cannot  be  currently  predicted.   The  Abl  exon  was 
divided  into  5  sub-domains  based  on  the  locations  of  these 
polymorphic  segments,  and  each  sub-domain  was  analyzed 
separately  by  parsimony  analysis.   This  revealed  that  the 
alleles  in  each  sub-domain  could  be  organized  into  a  series 
of  highly  divergent  lineages.   Furthermore,  the  total  number 
of  mutations  needed  to  produce  all  of  the  lineages  in  all  of 
the  sub-domains  was  two-fold  lower  than  that  required  for 
lineages  constructed  from  the  entire  exon.   This  indicates 
that  these  sub-domain  lineages  have  much  lower  levels  of 
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homoplasy,  and  suggests  that  each  polymorphic  segment  is 
evolving  independently. 

The  Abl  Exon  Sub-Domain  Lineages  Represent  Ancestral 
Polymorphisms 

Parsimony  analysis  identified  5-11  distinct  lineages  in 

each  sub-domain;  primarily  defined  by  point  mutations 

occurring  in  the  polymorphic  segments.  The  consensus 

sequences  of  these  lineages  are  presented  in  Table  4-2. 

These  highly  diversified  polymorphic  segments  often  differ 

in  20-35%  of  their,  nucleotide  sequences.   The  majority  of 

the  diversity  between  polymorphic  segments  appears  to  result 

from  the  accumulation  of  point  mutations  over  long 

evolutionary  periods.   As  illustrated  in  Figure  4-2,  each 

sub-domain  lineage  contains  alleles  derived  from  multiple 

mouse  species,  or  even  both  mouse  and  rat.   The  data  in 

Table  4-3  illustrates  that  alleles  in  the  same  sub-domain 

lineage  often  have  identical  or  very  similar  nucleotide 

sequences,  yet  may  be  derived  from  evolutionary  distant 

rodent  species.   For  example,  some  polymorphic  segments  are 

identical  in  alleles  derived  from  mouse  and  rat,  indicating 

they  have  been  retained  as  polymorphisms  for  a  minimum  of  10 

million  years.   These  results  indicate  that  the  polymorphic 

segments  in  the  sub-domains  of  the  Abl  exon  are  extremely 

stable  polymorphisms,  some  of  which  first  arose  prior  to  the 

divergence  of  mice  and  rats. 
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AB  sub- 
domains 


Line- 
ages 


Nucleotide( 
Sequences 


Number  of 
Alleles'' 


Species 


BS1 


BS2 


BS3 


o-helix 


1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


1 
2 
3 
4 
5 
6 
7 


1 
2 

3 
4 
5 
6 

7 


13       23 
TACCAGTTCAAGGGCGAG 

C--CC-TTC 

GC G 

G CT- 

GT 


61       71      80 
CGGCTCGTGACCAGATACATC 

--ATCT A 

--ATCT---GA 

--ATAT 

A--TATC--GA T-- 

T .---- 

T 

AT G---T 

---AGT---GA CG-T-- 

--TA-A 

97  107 
TACGTGCGCTTC 
-GG 

---C A- 

A- 

-T-A 

C A- 

155      165      175      185 
CCAGACGCCGAGTACTGGAACAGCCAGCCGGAGATC 
AC--T- ***--- a-T-***- 

--CTCA T.***..-A-T-***. 

--CACA 

-GG T-AA T-- 

--GTGG CG AA 

--G T A 


14 

1,2,3,6,7,9,11 

8 

1,2,6 

5 

12,13 

3 

1.6 

23 

1,2,3,4,5,6,8,10 

12 

1,2,3.8 

6 

1,2.4,6 

5 

1,2.3,9 

2 

12 

11 

1,2,4,6 

4 

5,6,7 

4 

1.3 

1 

11 

1 

13 

1 

10 

13 

1.2,4,5,6,8.10 

13 

1.2,4.6.7,10 

4 

1 

4 

1.2.3 

7 

1,2,3,11,13 

4 

1,3,6 

2 

12 

29 

1,2,3,4,5,6,7,8.9,10 

14 

1.2,4,10,11 

4 

1.2,13 

2 

2 

2 

12 

1 

13 

1 

12 

3' segment 


1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 


242      252      262      272 

GGGCCGGAGACCCACACCTCCCTGCGGCGGCTT 

-A-A GT--C 

---GT 

AA-A C 

T C ***-- 

T C 

G-T-T---T 

T-AT T 

*****T T--T--T A C- 

---A CG A 

---GT G ***-- 


16 

1,2.3.4,6,11 

10 

1.2.12.13 

9 

1.2,4.7 

6 

1.2.3 

3 

1.6 

2 

5,6 

3 

12,13 

1 

8 

1 

10 

1 

10 
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Table   4-3 


Sub-domain     Lineage  Alleles 


Sub-domain  sequences 


MustAb1 

MudoAb9 

MusiAb1 

MucoAb1 

Abu 

MuspAb2 

MumuAb2 

MuplAb' 


GCATTTCGTGTACCAGTTCAAGGGCGAGTGCTACTTC 

C — CC-TTC A- 

C — CC-TTC A- 

GC C — CC-TTC A- 

GT G TC-TTC A- 

T — GT C — CC-TTC 

c c-TTC 

C — CC-TTC 

-G C C-T CC-TTC 


BS1 


MudoAb2 
MumuAb1 
MusiAb6 


-GC- 
-GC- 
-GC- 


-G 

-G 

-G 


Rara 
RT-1" 


-G 

-G 


— CT- 
CT- 


A- 

A- 


Abd 
MusiAb3 


-GT- 
-GT- 


A- 

A- 


a-helix 


MustAb1 

MucrAb1 
MucaAb2 
MumuAb4 
MusiAb6 
MusiAb4 
MucoAb1 
MuspAb3 
MumoAb1 
MuceAb1 
MuspAb1 


GCCAGACGCCGAGTACTGGAACAGCCAGCCGGAGATC 

-T AC — T-*** a-T-***- 

-T AC — T-*** a-T-***- 

AC — T-*** a-T-***- 

C-A TC — T-*** a-T-***- 

C-A — TC — T-*** a-T-***- 

-T TC — T-*** A-T-***- 

TC — T— *** A— T— ***— 

A C — T-*** a-T-***- 

T T— *** a— T— ***— 

T-*** a-T-***- 

T T-*** A-T-***- 


MudoAb5 
Ab1535" 
RT-lb 
MumuAb3 


-ACTCA 

-ACTCA 

--CTCA 

— CACA 


T-*** — 

-AC — T-***  — 

T-*** — 

-TC — T-*** — 


_A-T-***- 
-A-T-***- 
-A-T-***- 
-A-T-***- 


MucrAb2  

MudoAb2  c 

MucaAb1  C- 

MuplAb1  t A 
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Amplification  of  Ab  Diversity  bv  Intra-Exonic  Kecombination 

The  specific  association  of  the  polymorphic  segment 
lineages  in  the  various  sub-domains  comprising  the  Abl  exon 
occurs  in  numerous  combinations.   This  indicates  that  intra- 
exonic  recombination  has  occurred  during  the  diversification 
of  the  exon,  and  is  illustrated  in  Figure  4-6.   For  example, 
the  MumuAb5,  MucrAb1,  and  MuceAb1  alleles  all  have  sequences 
from  lineage  5  of  the  BS1  sub-domain,  but  have  sequences  in 
lineages  3,  9,  and  1  respectively  for  the  BS2  sub-domain 
(Figure  4-6A) .   Similar  random  associations  are  found  for 
all  the  lineages  in  all  the  sub-domains,  indicating  that 
shuffling  the  polymorphic  segments  in  each  sub-domain  to 
generate  novel  new  alleles  has  been  a  primary  evolutionary 
mechanism  for  Abl  exon  diversification.   Alleles  derived 
from  phylogenetically  distant  species  commojily  only  share 
one  or  two  polymorphic  segments,  indicating  that  several 
intra-exonic  recombinational  events  have  accumulated  during 
their  evolutionary  divergence.   This  shuffling  process 
accounts  for  the  formation  of  an  array  of  highly  divergent 
alleles,  which  may  differ  in  greater  than  10%  of  their 
nucleotide  sequence  in  the  Abl  exon. 

Despite  this  extensive  recombination,  stable 
associations  of  Abl  sub-domain  lineages  could  be  found  among 
alleles  derived  from  the  same  species  or  sub-species,  or 
closely  related  species.   Examples  of  such  alleles  are  shown 


95 
in  Figure  4-6B,  and  is  consistent  with  the  finding  of 
closely  related  clusters  of  alleles  defined  by  UPGMA 
analysis  in  Figure  4-2.   Analysis  of  the  entire  data  set 
revealed  24  of  the  56  seguences  could  be  organized  into  6 
groups  on  the  basis  of  sharing  4  or  more  contiguous  sub- 
domain  lineages.   All  of  these  groups  only  contained  alleles 
derived  from  the  same  species  of  species  that  have  diverged 
less  than  1-2  million  years.   These  results  suggest  that 
many  sub-domain  lineage  combinations  are  stable  over 
relatively  short  evolutionary  time  spans,  and  that  intra - 
exonic  recombinational  events  in  the  Abl  exon  accumulate 
slowly  over  evolutionary  periods. 

Although  intra-exonic  recombinational  events  act  to 
shuffle  the  polymorphic  segments,  this  does  not  imply  that 
the  recombinational  breakpoints  only  occur  at  specific  sites 
in  the  Abl  exon.   Rather,  these  events  occur  throughout  the 
exon  with  no  obvious  pattern.   This  is  illustrated  by  the 
distribution  of  the  phenotypically  silent  "A"  to  "T" 
transversion  substitution  at  nucleotide  position  37  among 
lineages  of  the  BS1  sub-domain.   The  "A"  nucleotide  is 
present  in  all  alleles  from  lineages  3  and  4;  the  "T" 
nucleotide  is  present  in  all  lineage  2  alleles.   The  "A"  and 
the  "T"  is  a  polymorphism  in  lineage  1  alleles  (Table  4-3) . 
This  distribution  pattern  is  most  easily  explained  by 
postulating  that  the  recombinational  breakpoints  have 
occurred  on  both  sides  of  nucleotide  position  37  during  the 
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evolution  of  the  exon.   Such  a  process  would  result  in  the 
shuffling  of  this  silent  polymorphism  among  lineages  of  Abl. 
Many  other  examples  of  wandering  single  nucleotide 
polymorphisms  can  be  found  throughout  the  exon  (ie. 
positions  14,  106,  126,  210).   In  addition,  several 
recombinational  events  have  occurred  within  a  polymorphic 
segment.   For  example,  the  nucleotides  CACA  at  positions 
155-158  in  the  a-helix  is  found  in  alleles  with  the  two 

5 

codon  deletions  (MudoAb  )  ,  as  well  as  in  undeleted  alleles 

2 

(MumuAb  ) .  These  results,  therefore  indicate  that 

recombinational  breakpoints  do  not  repeatedly  occur  at  the 
same  positions  within  the  exon. 

In  conclusion,  the  data  indicates  that  specific 
sections  of  the  Abl  exon,  or  the  polymorphic  segments, 
contain  many  highly  divergent  allelic  forms  that  are  stable 
over  long  evolutionary  periods.   The  entire  exon;  however, 
does  not  evolve  as  a  single  entity  due  to  repeated  intra- 
exonic  recombinations.   As  a  result,  the  extensive  diversity 
between  alleles  predominantly  arises  from  the 
recombinational  shuffling  of  these  highly  divergent 
polymorphic  segments  into  a  variety  of  novel  combinations. 
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The  Impact  of  Mhc  Class  II  Ab  Gene  Polymorphisms  on  the  Structure 

of  the  Antigen  Binding  Site 


In  an  effort  to  determine  whether  the  extensive 
diversity  of  the  Mhc  Ab  gene  was  generated  and  maintained 
via  selective  pressures  acting  to  diversify  the  antigen 
binding  site,  the  impact  of  the  polymorphism  on  the 
structure  of  the  antigen  binding  site  was  analyzed.   The 
hypothetical  3-dimensional  model  of  Mhc  class  II  molecules 
proposed  by  Brown  et  al.  (1988)  provides  a  framework  in 
which  to  assess  the  potential  impact  of  these  polymorphisms 
on  the  functional  properties  of  Ab.   Forty-eight  different 
Mus-derived  alleles  were  examined  in  this  portion  of  the 
study,  and  the  alleles  are  listed  in  Table  4-4.   The  antigen 
binding  site  was  divided  into  6  segments  based  on  the 
hypothetical  model  (Brown  et  al.  1988),  structural  features, 
and  patterns  of  polymorphism.   /3-pleated  sheets  1,  2,  and  3 
(BS1,  BS2,  and  BS3)  are  encoded  by  amino  acids  9-14,  26-31, 
and  37-40  respectively.   a-helix  1  extends  from  amino  acid 
56  to  amino  acid  position  65.   29.2%  of  the  alleles  contain 
a  proline  at  position  65,  and  68.8%  of  the  alleles  have  a 
codon  deletion  at  this  position.   The  a-helix  2  portion  of 
the  antigen  binding  site  extends  from  amino  acid  position  66 
to  79.   Every  allele  seguenced  encodes  cysteine  at  positions 
15  and  79,  and  these  residues  form  a  disulfide  bond  that 
probably  acts  to  anchor  the  a-helix  to  the  /3-pleated  sheet 


Table   4-4.      List   of  Ab  Alleles  Analyzed. 
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Allele 


Species 


Strain 


Geographic  Origin 


Ab" 
At/ 
UJ 

Atf 

Ab* 

Mf 

Ab< 

Ab' 

Ab' 

Ab" 

Ab"^ 

HudoAb7 

MudoAb2 

MudoAb3 

HudoAb'' 

HudoAbJ 

HudoAb" 

HudoAb" 

Ab" 

HudoAb7 

HudoAb* 

HudoAb9 

HudoAb' 

HudoAb' 

HudoAb 


HudoAb' 


HudoAb 
HudoAb 


HumuAbi=Abp 

HumuAb2 

HumuAb 

HumuAb 

HumuAb 

HumuAb<5=HudoAb* 


MycaAb' 
MucaAb2 


H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

M. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

M. 

m. 

domes t i  cus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

M. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

M. 

m. 

domesticus 

M. 

m. 

domesticus 

M. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

domesticus 

H. 

m. 

musculus 

H. 

m. 

musculus 

H. 

m. 

musculus 

H. 

m. 

musculus 

M. 

m. 

musculus 

H. 

m. 

musculus 

H. 

m. 

castaneus 

H. 

m. 

castaneus 

C57BI/6 

Lab  inbred 

BALB/C 

Lab  inbred 

B10.M 

Lab  inbred 

B10.UB 

Lab  inbred 

B10.BR 

Lab  inbred 

C3H.NB 

Lab  inbred 

B10.6R 

Lab  inbred 

B10.RIII 

Lab  inbred 

A.SU 

Lab  inbred 

B10.PL 

Lab  inbred 

NOD 

Lab  inbred 

DR1 

Florida 

B10.CAA2 

Michigan 

B10.STC77 

Michigan 

B10.SAA48 

Michigan 

B10.STC90 

Michigan 

B10.BUA16 

Michigan 

ERF0U05 

Morocco 

AZROU1 

Morocco 

AZROU3 

Horocco 

METK0VIC2 

Yugoslavia 

ERFOUD1 

Morocco 

FAYIUH4 

Egypt 

FAYIUH5 

Egypt 

JERUSALEH4 

Israel 

24CI 

Italy 

38CH 

Italy 

METKOVIC1 

Yugoslavia 

VIBOURG7 

Denmark 

HBS 

Bulgaria 

HDS 

Denmark 

HBT 

Bulgaria 

MYL 

Yugoslavia 

BRN04 

Czechoslovakia 

CAS 

Thailand 

T HON BUR 11 

Thailand 

MumoAb 


HuspAb 
HuspAb2 
HuspAb-* 

HusiAb7 
HusiAb2 
HusiAb* 
HusiAb' 
HusiAb5 
MusiAb" 


MustAb' 


MucoAb 


MuceAb 


MucrAb' 
MucrAb2 


M.  m.  molossinus  MOL  Japan 

M.  spretus  SEG  Spain 

M.  spretus  SEI  Spain 

M.  spretus  STF  Tunisia 

M.  spicilegus  PANSEV01  Yugoslavia 

M.  spicilegus  PANSEV02  Yugoslavia 

M.  spicilegus  PANSEVOB  Yugoslavia 

M.  spicilegus  ZRU  Bulgaria 

M.  spicilegus  ZYD  Yugoslavia 

M.  spicilegus  ZBN  Bulgaria 

M.  spretoides  XBJ  Bulgaria 

M.  cookii  COK  Thailand 

M.  cervicolor  popaeus    CRP  Thailand 

M.  carol i  KAR  Thailand 

M.  carol i  KAR2  Thailand 


HuptAb 


M.  platythrix 


PTX 


India 


' 
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structure.   The  a-helix  3  section  extends  from  amino  acid  80 
to  amino  acid  89.   The  most  common  amino  acid  at  position  89 
is  proline,  and  in  contrast  to  the  proline  at  residue  65 
which  most  likely  kinks  the  a-helix,  the  proline  at  position 
89  probably  destroys  the  secondary  a-helical  structure. 

The  Distribution  of  Amino  Acid  Diversity 

The  amino  acid  sequences  predicted  from  the  nucleotide 
sequences  of  48  Mus-derived  alleles  of  Ab  exon  2  are 
presented  in  Figure  4-7.   The  alleles  in  the  panel  differ  by 
an  average  of  8.6  amino  acids,  and  most  of  this  diversity  is 
localized  within  discrete  segments  of  the  exon.   This  is 
illustrated  by  analysis  of  the  protein  sequence  variability 
by  the  convention  of  Kabat  and  Wu  (Wu  and  Kabat  1970) (Figure 
4-8,  and  Table  4-5).   There  are  eleven  positions  with  high 
variability  (V  >  7.0),  and  ten  positions  with  moderate 
variability  (4.0  <  V  <  5.5),  and  all  these  residues  are 
located  in  segments  that  comprise  distinct  portions  of  the 
antigen  binding  site  (Figure  4-9) .   These  antigen  binding 
site  segments  are  the  /3-pleated  sheets  that  form  the  floor 
of  the  antigen  binding  site,  and  the  a-helix  that  forms  the 
side  of  the  binding  site  groove.   It  should  be  noted  that 
the  amino  acid  residues  predicted  by  Brown  et  al.  (1988)  to 
directly  contact  antigenic  fragments  (marked  by  arrows  in 
Figure  4-8)  represent  only  6  of  the  11  most  polymorphic 
residues.   This  suggests  that  amino  acid  positions  other 
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Kabat  and  Wu  variability  data  for  Ab  exon  2 
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Amino  Acid             Kabat   and  Wu  Value 

Position                        48  Alleles 

6) 

2.04 

51) 

7) 

2.04 

52) 

8) 

2.04 

53) 

9) 

10.92 

54) 

10 

)                                  1.00 

55) 

11 

)                                  2.82 

56) 

12 

)                                  4.36 

57) 

13 

)                                  7.74 

58) 

14 

)                                  8.89 

59) 

15 

)                                  1.00 

60) 

16 

)                                  1.00 

61) 

17 

)                                  4.00 

62) 

18 

)                                  1.00 

63) 

19 

)                                  1.00 

64) 

20 

)                                  1.00 

65) 

21 

)                                  1.00 

66) 

22 

)                                  1.00 

67) 

23 

)                                  2.04 

68) 

24 

)                                  1.00 

69) 

25 

)                                  1.00 

70) 

26 

1                                  16.70 

71) 

27 

2.04 

72) 

28 

13.10 

73) 

29 

2.04 

74) 

30 

4.68 

75) 

31 

2.04 

76) 

32, 

\                                 1.00 

77) 

33] 

>                                  1.00 

78) 

34] 

1                                  3.13 

79) 

35] 

I                                  1.00 

80) 

36] 

I                                  1.00 

81) 

37] 

1                                 7.38 

82) 

38] 

6.00 

83) 

39] 

1.00 

84) 

40] 

2.46 

85) 

4i; 

1.00 

86) 

42] 

1.00 

87) 

43] 

1.00 

88) 

44; 

3.13 

89) 

45] 

2.18 

90) 

46; 

1.00 

91) 

47; 

3.69 

92) 

48] 

1.00 

93) 

49; 

1.00 

94) 

50; 

2.13 

95) 

2.04 
1.00 
1.00 
1.00 
1.00 
3.79 


7. 

4. 
1. 
2. 


39 
80 
00 
04 
10.00 
1.00 
2.90 
1.00 
4.36 
2.90 
4.36 
1.00 
1.00 
3.69 
3.27 
4.27 
1.00 
5.58 
5.33 
2.04 
2.04 
3.60 
1.00 
1.00 
2.13 
00 
00 
13 
06 
18.45 
3.13 
4.92 
16.00 
2.04 
1.00 
1.00 
2.34 
1.00 
3.13 


1. 
1. 
3. 
7. 
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than  those  predicted  to  be  contact  amino  acids  may  play  a 
prominent  role  in  determining  allelic  binding  capacity. 
As  illustrated  in  Figure  4-9,  the  most  polymorphic 
residues  in  0-strands  2  and  3  do  not  exactly  coincide  with 
the  central  floor  of  the  antigen  binding  site. 
Interestingly,  /3-strand  1  is  separated  from  0-strands  2  and 
3  by  a  /3-turn  segment  whose  exact  structure  cannot  be 
definitively  determined  from  the  class  I  model.   Perhaps  the 
jS-turn  in  this  segment  is  more  abrupt  than  proposed  by  Brown 
et  al.  (1988) .   Consequently,  this  would  have  the  effect  to 
more  precisely  align  the  most  polymorphic  amino  acids  of  fi- 
strands  2  and  3  with  the  inner  floor  of  the  antigen  binding 
site.   Alternatively,  if  /3-strands  2  and  3  are  properly 
aligned  in  the  hypothetical  model,  some  of  the  most 
polymorphic  amino  acids,  such  as  positions  26  and  37,  could 
form  highly  diversified  pockets  upon  interaction  with  the  a- 
helical  residues. 

It  is  interesting  to  note  that  amino  acid  position  57 
is  highly  variable  (V  =  7.39),  with  six  different  amino 
acids  detected  among  the  48  alleles  examined.   Polymorphisms 
of  position  57  were  previously  suggested  to  increase 
susceptibility  to  diabetes  (Todd  et  al.  1988) . 

Forty-three  percent  (38/89)  of  the  amino  acids  that 
comprise  Ab  exon  2  are  conserved  (V  =  1)  among  the  alleles 
in  the  data  set  (Figure  4-8,  and  Table  4-5).   Twelve  of 
these  conserved  residues  are  located  within  the  antigen 
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binding  site;  2  in  the  /?-pleated  sheet  segments,  and  10  are 
scattered  throughout  the  a-helical  region  in  no 
distinguishable  pattern  (Figure  4-9) .   Three  of  these 
conserved  residues  play  critical  roles  in  maintaining  proper 
tertiary  structure  of  the  Abl  domain.   For  instance, 
residues  15  and  79  both  encode  for  cysteine,  and  form  the 
disulfide  bond  stabilizing  the  structure  of  the  molecule. 
In  addition,  the  N-linked  glycosylation  site  at  residue  19 
is  conserved.   Sixty-one  percent  (23/38)  of  the  conserved 
amino  acids  are  located  in  the  /J-pleated  sheets  outside  the 
antigen  binding  site,  or  form  the  /3-turns. 

Overall,  these  results  indicate  that  extensive  amino 
acid  diversity  is  only  permitted  in  parts  of  the  molecule 
that  can  alter  antigen  binding  capacity,  such  as  the  antigen 
binding  site.   In  contrast,  a  certain  amount  of  amino  acid 
conservation  is  required  to  maintain  the  structural 
integrity  of  the  Abl  domain. 

Selective  Diversification  of  Antigen  Binding  Site  Codons 

To  investigate  the  possibility  that  diversifying 
selection  was  involved  in  the  development  of  antigen  binding 
site  variability,  the  frequencies  of  nonsynonymous  (dn, 
amino  acid  replacement)  and  synonymous  (ds, silent) 
substitutions  was  measured  as  described  by  Hughes  and  Nei 
(1988) ,  and  performed  on  the  alleles  of  our  data  set.   As 
shown  in  Table  4-6,  any  two  Ab  exon  2  sequences  in  the  data 
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set  differ  on  average  by  9.7%  nonsynonymous  substitutions. 
This  is  in  contrast  to  the  average  percentage  of  synonymous 
changes  between  alleles,  which  is  3.9%.   Since  the  majority 
of  substitutions  in  eukaryotic  and  prokaryotic  genes  are 
synonymous,  the  ratio  of  nonsynonymous  to  synonymous 
substitutions  (dn/ds)can  be  used  as  a  measurement  of  the 
degree  of  diversifying  selection.   These  data  provide 
evidence  that  Ab  exon  2  is  selectively  diversified  (dn/ds= 
2.5).   As  illustrated  in  Table  4-6  and  Figure  4-10  the 
codons  forming  the  antigen  binding  site  are  experiencing 
more  nonsynonymous  substitutions  and  fewer  synonymous 
substitutions  (dn/ds=  2.8)  than  non-antigen  binding  site 
codons  in  exon  2  or  any  codons  in  exon  3  (dn/ds=  0.6  and 
0.15,  respectively).   Analysis  of  each  individual  antigen 
binding  site  segment  separately  shows,  with  the  exception  of 
the  a-helix  1  portion  which  has  a  very  high  freguency  of 
synonymous  mutations,  a  much  higher  freguency  of 
nonsynonymous  substitutions  than  synonymous  substitutions. 
This  pattern  is  particularly  apparent  for  the  codons 
predicted  to  encode  amino  acids  that  contact  antigenic 
fragments  directly  (dn/ds=  22.1) (Table  4-6) (Brown  et  al. 
1988) .   These  results  strongly  support  the  conclusion  that 
diversifying  selection  is  predominantly  responsible  for 
antigen  binding  site  diversity. 
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Codon  Deletions  of  the  Abl  Domain 

The  a-helix  and  carboxyl  terminus  segment  of  the  Abl 
domain  occurs  in  5  different  lengths  among  the  48  Ab  alleles 
analyzed.   In  addition  to  the  undeleted  and  65/67  deletion 
forms  found  in  laboratory  mouse  strains  (Choi  et  al.  1983; 
Estess  et  al.  1988) ,  alleles  with  codon  deletions  of 
positions  65/67/72,  65/67/88-90,  and  65/67/93  were  detected 
(Figures  4-7  and  4-11) .   The  impact  of  these  various  codon 
deletions  on  the  structure  of  the  antigen  binding  site  may 
not  be  equivalent.   For  instance,  the  deletions  of  amino 

acids  65,  67,  and  72  (the  deletion  of  amino  acid  72  was 

5 
found  in  only  allele,  MudoAb  )  occur  in  the  51  section  of 

the  a-helix,  whereas  the  codon  deletions  of  amino  acids  88- 

90,  and  93  occur  3'  the  stabilizing  disulfide  bond,  in  the 

segment  connecting  the  Abl  and  Ab2  domains. 

The  codon  deletions  of  amino  acids  65,  67,  and  72  would 

be  predicted  to  change  the  orientation  of  amino  acids  in  the 

a-helix  segments  adjacent  to  the  deletions  relative  to  the 

undeleted  alleles.   Consequently,  these  codon  deletions 

should  change  the  positions  of  some  contact  amino  acids  with 

R  groups  orientated  into  the  antigen  binding  site,  thereby 

profoundly  modifying  the  interior  structure  of  at  least  a 

portion  of  the  binding  site.   In  addition,  all  the  alleles 

that  do  not  delete  amino  acids  65  and  67,  with  the  exception 

of  the  MuplAb  allele,  have  a  proline  residue  at  position 

65.   This  would  most  likely  introduce  a  kink  into  the  a- 
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helix  structure.   Thus,  the  two  forms  of  a-helices  have  very 
different  structures,  and  most  likely  have  some  different 
amino  acid  side  chains  facing  into  the  interior  of  the 
antigen  binding  site.   To  test  the  possibility  that  the 
patterns  of  amino  acid  variability  might  change  due  to 
strong  diversifying  selection  acting  on  the  contact  amino 
acids  (Table  4-6) ,  the  variability  of  a-helix  segments  1  and 

2  from  undeleted  and  65/67  deleted  alleles  were  measured 
separately.   As  shown  in  Figure  4-12  and  Table  4-7,  5 
variable  positions  are  unique  to  the  undeleted  alleles,  and 

3  variable  positions  are  unique  to  the  65/67  deleted 
alleles.   The  most  striking  difference  between  the  two 
alleles  is  at  position  61  of  the  deleted  alleles.   The  high 
variability  of  position  61  suggests  that  the  codon  deletions 
rotate  the  a-helix  in  such  a  manner  that  the  side  chain  of 
amino  acid  61  becomes  a  contact  amino  acid,  and,  therefore 
undergoes  strong  diversifying  selection.   Furthermore,  amino 
acid  56  is  also  uniquely  diversified  in  the  65/67  deleted 
alleles,  additionally  suggesting  that  the  deletions  change 
the  orientation  of  the  a-helix  further  51.   Overall,  these 
results  indicate  that  diversifying  selection  operates  on 
different  amino  acid  positions  in  these  alleles  of  the  a- 
helix.   This  suggests  that  the  codon  deletions  in  the  51 
portion  of  the  a-helix  may  change  the  position  of  some 
contact  amino  acids,  and  consequently  dramatically  effect 


H 

r-~ 

^3 

W 

-p  -d 

c 

C 

Q) 

fO 

X 

g, 

-H 
If)  rH 

Q) 

VD 

a) 

Ifl 

n 

x: 

i 

X 

c 

« 

-H 

o 

iH 

•H  X! 

Q)  -p 

0 

£ 

•H 

id 

1 

ia 

a) 

a 

0 

ft    ^ 

a> 

o 

x;  p 

UH 

-p 

(0 

■d 

<W  T 

<u 

O 

a) 

■H 

p  4H 

-P 

a) 

-H 

0  r-i 

w 

rH 

o 

M 

&T3 

a) 

> 

>1  W 

•H 

■P 

0)  T) 

•H 

rH 

iH 

0) 

>, 

•H 

t-i 

rH 

Xi    rH 

a) 

ifl 

ti 

a 

•H 

& 

*H  T5 

•H 

m 

C 

c 

> 

(0 

3 

3 

Ul 

Ifl 

5 

CD 

C 

r-i 

0 

T3 

<D 

•H 

CH+J 

«J  rH 

•H 

(d 

Ifl 

P 

0 

(0  Tf 

ft 

XI 

<D 

(0  P  P 

« 

Q) 

c 

rH 

(1) 

0) 

Ifl 

• 

T3 

QJ 

(N 

C 

h 

H 

3 

0) 

■<*• 

4H 

u 

0 

• 

0 

W    Q) 

^1 

<N 

£    rH 

3 

0   0) 

D>TJ 

U   r-{ 

•H 

c 

U   r-< 

fa 

<0  <   (0 

120 


AimavidVA 


121 
Table  4-7 


Kabat  and  Wu  variability  data  for  undeleted  and  amino  acid 
position  65,  and  67  deleted  alleles. 


Kabat 

and  Wu  Value 

Amino  Acid 

Undeleted 

65.  67  Deleted 

Position 

Alleles  (15) 

Alleles  (33) 

56 

1.00 

4.30* 

57 

3.46 

6.35 

58 

2.14 

3.81 

59 

1.00 

1.00 

60 

2.14* 

1.00 

61 

1.00 

9.43* 

62 

1.00 

1.00 

63 

1.00 

1.00 

64 

1.00 

1.00 

65 

2.14* 

1.00 

66 

1.00 

1.00 

67 

2.14* 

1.00 

68 

1.00 

1.00 

69 

1.00 

1.00 

70 

2.14 

3.96 

71 

3.46 

2.13 

72 

1.00 

3.30* 

73 

1.00 

1.00 

74 

3.46 

3.30 

75 

2.99 

4.50 

76 

2.14* 

1.00 

77 

2.14* 

1.00 

78 

2.50 

3.54 

79 

1.00 

1.00 

*  -  denotes  uniquely  diversified  position  for  that  a-helix 
allele. 
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binding  properties  of  the  antigen  binding  site  in  a  single 

step. 

The  codon  deletions  at  positions  88-90,  and  93  probably 
do  not  effect  the  a-helical  structure  of  the  antigen  binding 
site.   This  is  predicted  because  these  deletions  occur  31 
the  disulfide  bond  between  amino  acids  15  and  79  which 
presumably  anchors  the  a-helix  to  the  /3 -pleated  sheet.   In 
addition,  80%  (4/5)  of  the  alleles  deleted  at  position  93 
have  a  proline  residue  just  5'  which  would  probably  act  as 
an  a-helix  breaker  in  this  portion  of  the  molecule. 
However,  the  high  frequency  of  alleles  with  deletions  in  the 
carboxyl  terminus  of  the  Abl  domain  (12.5%),  and  the 
presence  of  the  deletion  at  position  93  in  multiple  species 
(K.    m.  domesticus  and  M.  soecilegus)  may  suggest  that 
deletions  in  this  portion  of  the  molecule  do  have  some 
functional  impact.   Perhaps  they  subtly  alter  the  plane  of 
the  antigen  binding  site.   Alternatively,  these  3"  deletions 
may  have  no  functional  value  to  the  molecule  and  may  simply 
be  hitchhiking  mutations. 

It  is  interesting  to  note  that  the  a-helix  3  and 
carboxyl  terminus  segments  of  the  Abl  domain  come  in  three 
basic  forms.   These  forms  are  dependent  upon  the  position 
and/or  presence  of  a  proline  residue.   The  proline  is  found 
either  at  position  86  or  89,  and  most  likely  acts  to  destroy 
the  a-helical  structure  3'  as  the  Abl  domain  extends  toward 
the  Ab2  domain.   On  the  other  hand,  27%  (13/48)  of  the 
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alleles  do  not  have  any  proline  residue  in  this  section  of 
the  molecule,  thereby  allowing  a-helical  secondary  structure 
to  continue  until  the  polypeptide  connects  with  the  Ab2 
domain.   Additionally,  the  a-helix  3  segment  is  extremely 
polymorphic  containing  2  of  the  3  most  diversified  amino 
acid  positions  (86,  and  89)  in  Ab  exon  2  (Figure  4-8,  Table 
4-5) .   Furthermore,  this  segment  has  undergone  strong 
diversifying  selection  (dn/dg=  12.5) (Figure  4-10,  Table  4- 
6) .   These  results  suggest  that  this  portion  of  the  molecule 
can  strongly  influence  antigen  binding  capacity  and  immune 
responsiveness . 

Nonconservative  Amino  Acid  Substitutions  in  the  Antigen 
Binding  Site 

Protein  parsimony  analysis  (Felsenstein  1988;  1989)  was 

used  to  obtain  allelic  lineages  for  each  of  the  6  defined 

antigen  binding  site  segments.   Between  5  and  7  lineages 

were  discerned  at  each  antigen  binding  site  segment.   The 

most  common  motif  was  used  as  the  prototypical  lineage 

motif,  and  was  always  found  in  multiple  alleles  (shown  in 

Tables  4-8  -  4-13) .   Comparisons  of  the  prototypic  motifs 

from  each  lineage  revealed  that  on  average  these  allelic 

forms  differed  by  32.4%  ±  1.7%  of  their  amino  acids. 

Surprisingly,  66.5%  ±  3.6%  of  these  substitutions  result  in 

non-conservative  amino  acid  changes  (ie.  changes  between 

acidic,  basic,  uncharged  polar,  or  non-polar  residues) . 
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Table  4-8 


Allelic  lineages  of  the  /3-pleated  sheet  1  portion  of  the 
antigen  binding  site  containing  amino  acids  9-14. 


Amino  Acid  Motif  Number  of  Alleles 

YQFKGE 
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1 
1 
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Table  4-9 


Allelic  lineages  of  the  ^-pleated  sheet  2  portion  of  the 
antigen  binding  site  containing  amino  acids  26-31. 


Amino  Acid  Motif 

Number  of  Alleles 

LVTRYI 

1) 

12 

c- 

1 

2) 

S-N 

8 

s 

1 

C-N 

1 

S-N-F- 

i\ 

1 

3) 

-_I 

./ 

6 

4) 

S-D 

5 

A-D 

1 

5) 

Y 

3 

H 

1 

I 

1 

6) 

— S-N- 

2 

— A-N- 

1 

— s 

1 

S N- 

1 

Y N- 

1 

GL V 

1 
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Table  4-10 


Allelic  lineages  of  the  /3-pleated  sheet  3  portion  of  the 
antigen  binding  site  containing  amino  acids  37-40. 


Amino  Acid  Motif  Number  of  Alleles 

YVRF 


13 

1 

13 

5 
2 

1 

4 
1 

4 

4 


1) 

W 

W--Y 

2) 

-L — 

3) 

FM — 

F 

FL-- 

4) 



L 

5) 

-L-Y 

6) 

Y 
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Table  4-11 

Allelic  lineages  of  the  a-helix  1  portion  of  the  antigen 
binding  site  containing  amino  acids  56-65. 

Amino  Acid  Motif  Number  of  Alleles 
PDAEYWNKQ* 

1)    S-P  13 

-R S-P  1 

-T — N — S-P  1 

— G S-P  1 

2)  S Y 6 

Y 4 

3)    /  5 

4)  — P — Y 4 

5)    F 2 

-Q— F 2 

S F 2 

-T F 1 

6)  — V 3 

HS 1 

G—  1 

-E C 1 
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Table  4-12 

Allelic  lineages  of  the  a-helix  2  portion  of  the  antigen 
binding  site  containing  amino  acids  66-79. 


Amino  Acid  Motif  Number  of  Alleles 

Y*LERTRAELDTVC 


1)  EI V 10 

EI V — A-  2 

EI VNM —  1 

EI 1 

EI R 1 

EI K — W 1 

2)    13 

* 1 

3)    Q 3 

QK — AV 1 

QK V 1 

Q— QV 1 

Q A-  1 

4)    v 5 

5)    E A-  2 

A-  1 
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Table  4-13 


Allelic  lineages  of  the  a-helix  3  portion  of  the  antigen 
binding  site  containing  amino  acids  80-89. 


5)    L 

Q-L 


Amino  Acid  Motif  Number  of  Alleles 

RHNYEGTETP 


1)    P--H  li 

p — s  1 

-Y P — H  1 

K-P--H  1 

PG-H  1 

S — H  1 

2)    -V--H    (;  8 

V — S  1 

V — R  1 

3)    K 5 

4)    E — V-  6 

-Y E — V-  2 


3 

1 


VL 1 

R  1 

-ILI**  i 

— P-SY  1 
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These  non-conservative  amino  acid  changes  would  individually 
be  expected  to  cause  sub-site  alterations  in  chemical  and 
charge-distribution  characteristics.   This  suggests  that 
antigen  binding  sites  composed  of  different  allelic  segments 
would  possess  highly  divergent  chemical  properties.   This  is 
illustrated  in  Figure  4-13A  and  B.   These  models  distinguish 
the  chemical  classification  of  the  amino  acids  at  each  of 
the  most  variable  residues  in  the  antigen  binding  site  with 
different  fill  patterns.   MudoAb6  and  MudoAb  are  two  M.  m. 
domesticus  alleles  that  share  only  one  allelic  segment  of 
the  antigen  binding  site.   These  Ab  alleles  differ  by  13 
amino  acid  substitutions,  and  10  of  these  substitutions  are 
non-conservative,  indicating  that  they  encode  antigen 
binding  sites  with  fundamentally  different  structural  (and 
presumably  binding)  properties. 

The  Role  of  Intra-Exonic  Recombination  in  Diversifying  the 
Antigen  Binding  Site 

Inter-allelic  "shuffling"  of  discrete  antigen  binding 

site  segments  via  intra-exonic  recombination  would  be 

predicted  to  produce  recombinant  alleles  with  antigen 

binding  site  properties  markedly  different  from  either 

parental  allele.   41  of  48  alleles  in  the  data  set  differ  by 

at  least  one  antigen  binding  site  allelic  segment, 

indicating  that  intra-exonic  recombinational  events  have 

been  involved  in  the  divergence  of  almost  all  the  alleles. 
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An  illustration  of  the  effect  of  this  recombination  on  the 
antigen  binding  site  structures  is  shown  in  Figure  4-13C. 
MudoAb14represents  an  Ab  lineage  that  could  have  been 

produced  by  a  single  intra-exonic  recombination  between 

6  11 

parental  alleles  similar  to  MudoAb  and  MudoAb 

(recombinational  breakpoint  between  BS3  and  a-helix  1) .   As 
a  result  of  this  single  hypothetical  recombinational  event, 
the  antigen  binding  site  of  MudoAb14differs  from  those  of 
either  parent  by  6  amino  acid  substitutions  (4  or  5  non- 
conservative)  .   Thus,  intra-exonic  recombination  between 
alleles  composed  of  highly  divergent  antigen  binding  site 
segments  has  the  ability  to  rapidly  create  alleles  with 
unique  antigen  binding  site  characteristics.   The  extent  of 
diversity  between  allelic  segments  of  the  antigen  binding 
site  coupled  with  the  amount  of  intra-exonic  recombination 
detected  among  the  alleles  in  the  data  set  strongly  suggests 
that  this  mechanism  has  contributed  greatly  to  the  formation 
of  alleles  with  highly  divergent  functional  properties. 

The  data  discussed  in  this  section  support  the  view 
that  there  has  been  positive  selection  for  mutations  that 
create  antigen  binding  site  diversity.   These  mutations  may 
simply  be  point  mutations  in  the  nucleotide  sequence  that 
may  lead  to  a  nonsynonymous  substitution,  and  quite  possibly 
change  the  biochemical  class  of  amino  acid.   Additionally, 
mutations  such  as  codon  deletions  and/or  intra-exonic 
recombination  events  can  rapidly  change  the  structure  and 
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binding  properties  of  the  antigen  binding  site.   All  of 
these  mechanisms  have  interacted  over  evolutionary  time  to 
create  a  diverse  array  of  antigen  binding  sites  in  natural 
mouse  populations. 

Serological  Characterization  of  the  Mhc  Class  II  A  Molecule 

In  an  effort  to  examine  the  structural  polymorphisms  of 
the  Mhc  class  II  A  molecule,  and  attempt  to  map  serological 
epitopes  on  the  Ab  chain,  a  serological  analysis  employing 
monoclonal  antibodies  was  undertaken.   A  panel  of  twelve 
monoclonal  antibodies  were  used  in  the  study.   Five 
monoclonal  antibodies  are  reportedly  specific  for  the  Ab 
chain,  five  monoclonal  antibodies  are  specific  for  the  Aa 
chain,  and  two  monoclonal  antibodies  either  react  with  an 
epitope  determined  by  the  interaction  of  both  chains,  or 
have  not  been  definitively  mapped  to  react  with  either  chain 
(Oi  et  al.  1978;  Pierres  et  al.  1980;  1981;  Kappler  et  al. 
1981;  Braunstein  and  Germain  1987).   Table  4-14  presents  the 
12  monoclonal  antibodies  used  in  this  study,  and  shows  their 
reactivity  patterns  with  laboratory  inbred  strains  of  mice. 

Mice 

All  the  mice  used  in  this  study  were  from  mouse  colony 
in  the  Tumor  Biology  Unit  of  the  Department  of  Pathology  and 
Laboratory  Medicine,  University  of  Florida,  or  from  our  wild 
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Table  4-14 


List  of  Monoclonal  Antibodies  Used  to  Analyze  Structural 
Polymorphisms  of  the  A  Molecule 


MONOCLONAL  ANTIBODY       INBRED  STRAIN  REACTIVITY 


40B  A^Ur'iv 

39H  auu' 

MKD6  ^q.u.r.p^f.v.nod 

25-9-17  Mo,q.P.v 


34-5-3  A£d,"'q,P,v 

10.2.16  AgU,,Ul"' 

4  OF  M 


k.u,f 


3JP  ^.k.q.u.r.p.j.^f.v 

'k 

>,  k.  q,  u,  r,  j,  p,  j,  £,  v 


39  J  Act' 

39F  Aak 

K25-8.7  Aa° 

K24-199  Ao*i^aod 
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mouse  colony,  located  at  the  Animal  Care  Facility, 
University  of  Florida.   The  wild-derived  mouse  strains  are 
homozygous  at  the  H-2  complex,  and  are  maintained  by  sibling 
matings.   The  wild-derived  mice  are  from  2  sub-species  of 
Mus  musculus;  M.  m.  domesticus  and  M.  m.  musculus;  and  are 
originally  from  diverse  geographic  origins.   The  mice 
analyzed  in  this  study  are  listed  in  Table  4-15. 


Serological  Characterization  of  the  A  Molecule 

The  two  A  molecule  specific  monoclonal  antibodies,  40B 
and  39H,  can  distinguish  3  alleles  of  the  A  molecule  (Table 
4-16) .   The  5  Aa  chain  specific  monoclonal  antibodies,  3JP, 
39J,  39F,  K25-8.7,  and  K24-199  can  distinguish  5  different 
Aa  alleles  (Table  4-17) .   In  addition,  the  monoclonal 
antibodies  3JP  and  K25-8.7  appear  to  recognize  the  same  or  a 
similar  determinant  on  the  Aa  chain  of  the  28  strains 
examined,  because  they  have  identical  reactivity  patterns. 
The  5  Ab  chain  specific  monoclonal  antibodies,  10.2.16,  25- 
9-17,  34-5-3,  MKD6,  and  40F  can  distinguish  8  different  Ab 
alleles  (Table  4-18). 

Mapping  of  Ab  Specific  Serological  Epitopes 

Three  Ab  specific  monoclonal  antibodies  apparently  map 
to  the  same  or  similar  epitope.   25-9-17  and  34-5-3  have  the 
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Table  4-15 


List  of  Strains  Used  in  the  Serological  Analysis  of  the  A 

Molecule 


STRAIN 

SUB-SPECIES 

GEOGRAPHICAL  ORIGINS 

C57/B6 

M. 

m. 

dom. 

laboratory 

inbred 

BALB/C 

M. 

m. 

dom. 

laboratory 

inbred 

B10.M 

M. 

m. 

dom. 

laboratory 

inbred 

B10.WB 

M. 

m. 

dom. 

laboratory 

inbred 

BIO. BR 

M. 

m. 

dom. 

laboratory 

inbred 

C3H.NB 

M. 

m. 

dom. 

laboratory 

inbred 

B10.6R 

M. 

m. 

dom. 

laboratory 

inbred 

B10.RIII 

M. 

m. 

dom. 

laboratory 

inbred 

A.SW 

M. 

m. 

dom. 

laboratory 

inbred 

BIO. PL 

M. 

m. 

dom. 

laboratory 

inbred 

B10.SM 

M. 

m. 

dom. 

laboratory 

inbred 

NOD 

M. 

m. 

dom. 

laboratory 

inbred 

NON 

M. 

m. 

dom. 

laboratory 

inbred 

B10.CAA2 

M. 

m. 

dom. 

Michigan 

B10.SAA48 

M. 

m. 

dom. 

Michigan 

BUA16 

M. 

m. 

dom. 

Michigan 

DR1 

M. 

m. 

dom. 

Florida 

JER1 

M. 

m. 

dom. 

Israel 

JER4 

M. 

m. 

dom. 

Israel 

AZROU2 

M. 

m. 

dom. 

Morocco 

AZROU3 

M. 

m. 

dom. 

Morocco 

ERFOUD1 

M. 

m. 

dom. 

Morocco 

ERFOUD5 

M. 

m. 

dom. 

Morocco 

MET1 

M. 

m. 

dom. 

Yugoslavia 

MET2 

M. 

m. 

dom. 

Yugoslavia 

FAYIUM4 

M. 

m. 

dom. 

Egypt 

BRN04 

M. 

m. 

mus. 

Czechoslavakia 

VIBOURG7 

M. 

m. 

mus. 

Denmark 
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Table  4-16 
Reactivity  of  A  Molecule  Specific  Monoclonal  Antibodies 


STRAIN 

MONOCLONAL 

ANTIBODY 

SUB 

H-2 

GEOGRAPHICAL 

40B 

39H 

SPECIES 

ORIGIN 

BALB/C 

+ 

+ 

dom 

d 

inbred 

B6 

+ 

+ 

dom 

b 

inbred 

BIO . BR 

+ 

+ 

dom 

k 

inbred 

BIO. PL 

+ 

+ 

dom 

u 

inbred 

B10.WB 

+ 

+ 

dom 

J 

inbred 

NON 

+ 

+ 

dom 

non 

inbred 

JER1 

+ 

+ 

dom 

Mudol6 

Israel 

JER4 

+ 

+ 

dom 

Mudol2 

Israel 

AZROU2 

+ 

+ 

dom 

Mudol7 

Morocco 

AZROU3 

+ 

+ 

dom 

Mudo7 

Morocco 

MET1 

+ 

+ 

dom 

Mudol5 

Yugoslavia 

FAYIUM4 

+ 

+ 

dom 

MudolO 

Egypt 

ERFOUD1 

+ 

+ 

dom 

Mudo9 

Morocco 

ERFOUD5 

+ 

+ 

dom 

Mudo6 

Morocco 

B10.CAA2 

+ 

+ 

dom 

Mudo2 

Michigan 

BUA16 

+ 

+ 

dom 

Mudo6 

Michigan 

B10.SAA48 

+ 

+ 

dom 

Mudo4 

Michigan 

B10.6R 

_ 

_ 

dom 

q 

inbred 

B10.M 

_ 

— 

dom 

f 

inbred 

C3H.NB 

— 

- 

dom 

p 

inbred 

A.SW 

_ 

- 

dom 

s 

inbred 

NOD 

_ 

— 

dom 

nod 

inbred 

MET2 

— 

— 

dom 

Mudo8 

Yugoslavia 

DR1 

_ 

— 

dom 

Mudol 

Florida 

BRNO  4 

_ 

— 

mus 

Mumu6 

Czech. 

VIBOURG7 

- 

- 

mus 

Mumul 

Denmark 

B10.RIII 

+ 

_ 

dom 

r 

inbred 

B10.SM 

+ 

— 

dom 

V 

inbred 
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Table  4-17 
Reactivity  of  Aa  Chain  Specific  Monoclonal  Antibodies 

STRAIN     MONOCLONAL  ANTIBODY 

3JP        39J        39F        K25-8.7    K24-199 


+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 

+  + 

+  + 

+  + 

+  + 

+  + 

+ 
+ 
+ 

+ 
+ 

+ 
+ 


C57/B6 

+ 

- 

- 

B10.6R 

+ 

- 

- 

BIO. PL 

+ 

- 

- 

C3H.NB 

+ 

- 

- 

B10.SM 

+ 

- 

- 

NON 

+ 

it'* 

- 

MET2 

+ 

,;'    _ 

- 

AZROU2 

+ 

- 

- 

AZROU3 

+ 

- 

- 

DR1 

+ 

- 

- 

ERFOUD1 

+ 

- 

- 

B10.CAA2 

+ 

- 

- 

B10.SAA48 

+ 

- 

- 

JER1 

+ 

- 

- 

BRN04 

+ 

- 

- 

VIB0URG7 

+ 

— 

— 

B10.M 

+ 

— 

— 

B10.WB 

+ 

- 

- 

JER4 

+ 

- 

- 

ERFOUD5 

+ 

- 

- 

BUA16 

+ 

- 

— 

B10.RIII 

+ 

+ 

— 

A.SW 

+ 

+ 

- 

MET1 

+ 

+ 

— 

BALB/C 

- 

- 

- 

NOD 

— 

— 

— 

BIO. BR 

+ 

+ 

+ 

FAYIUM4 

+ 

+ 

+ 
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Table  4-18 
Reactivity  of  A£  Chain  Specific  Monoclonal  Antibodies 


STRAIN 

MONOCLONAL 

ANTIBODY 

10.2.16 

25-9-17 

34-5- 

•3 

MKD6 

40F 

B10.RIII 

+ 

__ 

,  , 

+ 

._ 

ERF0UD5 

+ 

- 

- 

+ 

- 

NOD 

+ 

- 

- 

+ 

- 

BUA16 

+ 

- 

- 

+ 

- 

MET1 

+ 

— 

— 

+ 

— 

BIO. PL 

+ 

— 

— 

+ 

+ 

B10.M 

+ 

- 

+ 

+ 

NON 

+ 

_ 

- 

+ 

+ 

AZROU3 

+ 

— 

— 

+ 

+ 

A.SW 

+ 

— 

— 

— 

- 

B10.WB 

+ 

- 

- 

- 

- 

JER4 

+ 

— 

— 

— 

— 

BIO. BR 

+ 

— 

— 

— 

+ 

FAYIUM4 

+ 

- 

— 

— 

+ 

BALB/C 

- 

+ 

+ 

+ 

- 

B10.6R 

- 

+ 

+ 

+ 

- 

B10.SM 

- 

+ 

+ 

+ 

- 

C3H.NB 

- 

+ 

+ 

+ 

- 

MET2 

- 

+ 

+ 

+ 

- 

DR1 

- 

+ 

+ 

+ 

- 

B10.SAA48 

- 

+ 

+ 

+ 

- 

VIBOURG7 

- 

+ 

+ 

+ 

- 

BRN04 

— 

+ 

+ 

+ 

— 

B10.CAA2 

— 

+ 

+ 

— 

— 

C57/B6 

- 

+ 

+ 

- 

- 

AZROU2 

— 

+ 

+ 

— 

— 

JER1 

- 

+ 

+ 

+ 

+ 

ERFOUD1 

+ 

+ 

+ 

+ 

+ 
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same  reactivity  patterns.   And  these  two  monoclonal 
antibodies,  with  the  exclusion  of  the  strain  ERF0UD1,  have 
an  antithetical  relationship  with  the  monoclonal  antibody 
10.2.16.   The  grouping  of  Ab  alleles  dependent  upon  these 
monoclonal  antibody  reactivities  illustrates  that  these 
antibodies  recognize  the  different  alleles  of  the  a-helix 
(Figure  4-14).   The  10.2.16+,  25-9-17",  34-5-3~alleles  all 
have  the  amino  acid  deletions  at  positions  65  and  67.  In 
direct  contrast,  alleles  that  are  10.2.16",  25-9-17+,  and  34- 
5-3+  are  all  undeleted,  and  have  a  proline  at  position  65. 
Although  it  is  impossible  to  definitively  map  where  on  the 
a-helix  these  monoclonal  antibodies  bind,  this  data 
illustrates  that  these  monoclonal  antibodies  can  distinguish 
between  two  very  different  forms  of  the  a-helix. 

This  data  is  consistent  with  that  of  Figueroa  et  al. 
(1988)  in  comparing  the  8  alleles  in  common  with  the  two 
analyses.   Alleles  that  are  10.2.16+,  25-9-17",  and  34-5-3~ 
are  H-2A.m27  and  H-2A.m25~.  Conversely  the  reciprocal  is 
observed,  alleles  that  are  10.2 .  16",  25-9-17+,  and  34-5-3+ 
are  H-2A.m25+  and  H-2A.m27~. 

The  epitopes  for  the  other  two  Ab  chain  specific 
monoclonal  antibodies  do  not  correlate  with  any  structural 
features  encoded  within  Ab  exon  2.  Thus,  the  epitopes  for 
MKD6  and  4 OF  may  map  to  the  Ab2  domain,  but  this  is  highly 
unlikely  due  to  the  limited  phenotypic  variability  encoded 
by  this  domain.   Alternatively,  it  is  possible  the  epitopes 
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recognized  by  these  two  monoclonal  antibodies  are  dependent 
upon  Aa  chain  interaction,  as  opposed  to  strictly  Ab  encoded 
polymorphisms.   Finally  their  epitopes  may  have  been 
incorrectly  mapped  to  the  Ab  chain. 


CHAPTER  V 
DISCUSSION 


The  generation  of  Mhc  polymorphism  is  a  two  step 
process  involving  the  production  of  variant  alleles  by 
mutagenic  events  followed  by  subsequent  selection  and 
maintenance.   The  alleles  examined  in  this  study  have 
survived  this  process.   Additionally,  by  studying  alleles 
derived  from  multiple  rodent  species  a  clearer  understanding 
of  the  evolutionary  processes  involved  in  Mhc  gene 
diversification  and  selection  can  be  elucidated. 
Unfortunately,  what  cannot  be  ascertained  by  analyzing 
alleles  that  have  already  undergone  the  processes  of 
selection  and  maintenance,  is  the  spectrum  of  variants  upon 
which  selection  operates.   Nevertheless,  this  can  be 
inferred  by  studying  the  patterns  of  diversification. 

The  Genetic  Mechanisms  of  Mhc  Gene  Diversification 

The  results  of  this  dissertation  illustrate  that  variation 
is  introduced  into  single  alleles  predominantly  by  point 
mutations,  and,  to  a  lesser  extent,  codon  deletions.   Novel 
combinations  of  specific  antigen  binding  site  segments  are 
constantly  being  formed,  albeit  at  a  very  slow  rate,  by 
recombinational  mechanisms.   Thus,  the  extensive  diversity 
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of  Mhc  class  II  Ab  is  generated  by  multi-step  mutational 
processes,  incorporating  both  the  retention  of  ancestral 
polymorphisms  and  recombinational  events. 

The  extent  of  recombination  in  the  alleles  examined  in 
this  study  conclusively  illustrate  its  central  role  in  the 
diversification  process  over  evolutionary  time  spans.   The 
exact  recombinational  mechanism  employed  was  not  addressed 
by  these  experiments;  although,  the  analysis  does  indicate 
that  site  specific  hyper-recombinational  mechanisms  are  not 
involved  in  this  process.   Either  homologous  recombination 
or  segmental  exchange  could  be  equally  responsible  for  the 
patterns  of  diversity  observed.   Analysis  of  DNA  sequences 
flanking  Ab  exon  2,  such  as  Ab  exon  3,  and  DNA  segments  in 
Ab  introns  1  and  2,  indicate  that  in  contrast  to  Ab  exon  2, 
the  flanking  sequence  polymorphisms  correlate  with  each 
other  and  the  retroposon  polymorphism  in  intron  2  of  Ab 
(J.X.  She  unpublished  observations) .   This  strongly  suggests 
that  segmental  exchange  events  specific  for  Ab  exon  2  are 
responsible  for  the  shuffling  of  allelic  antigen  binding 
site  segments. 

Combinatorial  Association  of  the  Aa  and  Ab  Chains 

The  extensive  amino  acid  diversity  of  Ab  exon  2 
combined  with  the  extent  of  recombination  observed  among  the 
alleles  in  the  data  set,  make  the  hypothesis  of  co-evolution 
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of  the  Aa  and  Ab  chains  highly  unlikely.   This  is  even  more 
apparent  when  the  diversity  of  the  protein  segments  thought 
to  be  responsible  for  chain  association  are  considered.   For 
instance,  there  are  23  different  forms  of  Ab  0-pleated  sheet 
1  among  the  alleles  in  the  data  set.   An  attempt  to 
correlate  the  allelic  Ab  BS1  forms  with  BS1  forms  of  the  Aa 
published  sequences  (Jones  et  al.  1990)  reveals  no 
associations.   Furthermore,  there  is  very  strong 
diversifying  selection  acting  on  the  Ab  0-pleated  sheet  1 
antigen  binding  site  segment  (dyd,  =  9.7). 

By  examination  of  the  hypothetical  3-dimensional  model 
of  the  Mhc  class  II  antigen  binding  site  (Brown  et  al. 
1988) ,  it  is  tempting  to  speculate  a  class  II  gene  specific 
interaction  between  the  3'  segment  of  the  Ab  a-helix  and  the 
51  segment  of  the  Aa  a-helix.   Both  regions  have  an 
undefined  tertiary  structure  based  on  the  class  I  model. 
However,  there  is  also  no  correlation  to  be  made  between  Ab 
and  Aa  protein  sequence  polymorphisms  in  this  region. 
Additionally,  there  is  also  strong  diversifying  selection 
acting  on  this  Ab  segment  (dn/d,  =  12.5). 

These  observations  suggest  that  Mhc  class  II  A  molecule 
subunit  association  is  a  more  complex  process  than  recent 
experimental  evidence  outlined,  and  perhaps  infers  a  more 
central  role  for  the  invariant  chain  (Ii)  in  subunit 
pairing.   For  instance,  in  pairs  of  chains  that  have  a  high 
efficiency  of  association  the  role  of  the  invariant  chain  to 
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stabilize  the  resultant  molecule  may  be  minimal.   However, 
in  cases  of  lower  chain  association  efficiency,  possibly  due 
to  nonsynonymous  point  mutations  of  critical  chain  pairing 
residues  or  intra-exonic  recombination,  the  role  of  the 
invariant  chain  to  pair  subunits  may  become  more  critical  to 
subsequent  cell  surface  expression.   This  scheme  does  raise 
the  question  of  the  number  of  class  II  A  molecules  that  may 
be  expressed  on  the  cell  surface  of  heterozygotes  (either  2, 
3,  or  4  molecules).   The  answer  may  lie  with  each  specific 
heterozygote  combination  and  the  efficiency  of  each 
individual  chain  to  associate  with  others,  and  properly  form 
a  functional  molecule.   This  model  presumes  that  all  A 
molecule  subunit  associations  would  occur  more  freely  than 
inter-isotypic  pairing  of  class  II  molecule  subunits. 

Mhc  Influence  on  Immune  Responsiveness 

The  physiological  role  of  Mhc  molecules  is  to  bind 
fragments  of  processed  antigen,  thereby  making  them 
recognizable  to  T  lymphocytes  (Shown  in  Figure  5-1) .   Mhc 
class  II  molecules  can  influence  immune  responsiveness  in  3 
important  ways.   First,  class  II  molecules  must  be  able  to 
bind  fragments  of  foreign  antigen,  to  make  them  recognizable 
to  regulatory  T  lymphocytes.   This  interaction  will 
subsequently  initiate  a  specific  immune  response  toward  the 
foreign  antigen.   Mhc  class  II  molecules  are  also  the  focal 
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point  of  regulatory  T  lymphocyte  antigen  receptor  repertoire 
selection  events  in  the  thymus.   The  T  lymphocyte  antigen 
receptor  must  be  able  to  interact  in  some  manner  with  a 
class  II  molecule  in  order  to  recognize  bound  antigenic 
fragments.   However,  both  the  T  cell  receptor  binding 
regions  and  Mhc  class  II  molecules  are  extremely  variant 
within  a  species,  and  are  genetically  unlinked  loci.   This 
necessitates  the  individual  selection  of  T  lymphocytes 
expressing  antigen  receptors  that  can  interact  with  one's 
class  II  molecules  (positive  selection),  but  will  not  react 
with  one's  class  II  molecule  bound  to  fragments  of  self- 
antigen  (negative  selection) (Reviewed  by  Schwartz  1989)  . 
Thus,  the  second  way  Mhc  class  II  molecules  can  influence 
immune  responsiveness  is  through  positive  selection  of  the  T 
lymphocyte  antigen  receptor  repertoire.   And  the  third 
immune  response  influencing  mechanism  is  the  presentation  of 
self-antigens  leading  to  negative  selection  of  the  T  cell 
repertoire.   Therefore,  residues  that  determine  antigen 
binding  (both  self  and  non-self) ,  and  T  lymphocyte  contact 
residues  on  the  class  II  molecule  (presumably  on  the  a- 
helix)  can  powerfully  shape  an  individuals  immune  response 
characteristics . 
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The  Selective  Maintenance  of  Antigen  Binding  Site  Diversity 

A  variety  of  data  indicates  that  Mhc  polymorphism  is 
maintained  by  some  type  of  balancing  selection;  although, 
the  precise  mechanisms  involved  have  remained  controversial. 
Overdominance  (heterozygote  advantage)  and  frequency- 
dependent  selection  (rare  allele  advantage)  have  generally 
been  thought  to  be  the  most  likely  candidates  of  balancing 
selection  (Bodmer  1972;  Hughes  and  Nei  1988). 

The  maintenance  of  Mhc  polymorphisms  by  overdominance 
is  based  on  the  experimental  observation  that  Mhc-linked 
immune  responsiveness  is  a  dominant  or  co-dominant  genetic 
trait  (Benacerraf  and  Germain  1978) .   Mhc  heterozygotes  are 
capable  of  responding  to  any  of  the  antigens  recognized  by 
either  parental  haplotype,  because  Mhc  molecules  encoded  by 
both  Mhc  haplotypes  are  expressed  on  the  surface  of  antigen 
presenting  cells.   Therefore,  immune  response  overdominance 
proposes  that  the  ability  of  Mhc  heterozygotes  to  bind  and 
present  a  wider  range  of  foreign  antigens  will  enhance  their 
ability  to  resist  infectious  diseases,  thereby  increasing 
their  relative  fitness  in  a  population.   The  ability  of 
overdominant  selection  to  maintain  extensive  polymorphisms 
has  been  established  by  both  theoretical  treatments  and 
experimental  observations  (Nei  1987) . 

The  hypothesis  of  rare  allele  advantage  is  based  on  the 
idea  that  endemic  pathogens,  which  evolve  much  more  rapidly 
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than  their  endemic  hosts,  will  tend  to  adapt  their 
antigenicity  to  minimize  immune  recognition  by  the  most 
prevalent  Mhc  genotypes  in  a  population.   Consequently,  new 
or  rare  Mhc  alleles,  whose  immune  response  properties  differ 
from  the  common  alleles,  will  have  a  possible  selective 
advantage  due  to  increased  resistance  to  the  prevalent 
pathogens.   This  type  of  selection  predicts  cyclic 
fluctuations  in  the  frequencies  of  Mhc  alleles  as  pathogens 
are  driven  to  evolve  their  antigenicity  toward  evading  the 
immune  response  characteristics  of  prevalent  Mhc  alleles. 
Thus,  rare  allele  advantage  leads  to  the  maintenance  of  Mhc 
polymorphisms  by  rescuing  rare  alleles  from  extinction. 
There  is  no  empirical  evidence;  however,  to  indicate  that 
this  mode  of  selection  operates  to  maintain  Mhc 
polymorphism . 

Divergent  Allele  Advantage 

The  extensive  structural  diversity  of  the  antigen 
binding  site  of  the  Mhc  class  II  Ab  gene  indicates  that 
another  selective  mechanism  must  be  operating  to 
specifically  maintain  the  vast  array  of  highly  divergent 
alleles.   This  specific  type  of  selection,  termed  divergent 
allele  advantage,  may  act  in  concert  with  the  other  forms  of 
balancing  selection  commonly  thought  to  operate  on  Mhc  genes 
(Bodmer  1972;  Zinkernagel  and  Dougherty  1974).   All  three 
modes  of  selection  would  contribute  to  the  maintenance  of 
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Mhc  polymorphisms,  but  divergent  allele  advantage  would 
preferentially  favor  the  maintenance  of  highly  divergent 
alleles  within  populations. 

The  results  of  this  dissertation  clearly  illustrate 
that  the  various  Ab  allelic  antigen  binding  sites  have  very 
divergent  structures.   These  structural  polymorphisms 
strongly  impact  on  the  functional  properties  of  Mhc  class  II 
molecules,  and  subsequently  impact  on  immune  responsiveness 
to  foreign  antigen. 

Immune  Response  Variation  Between  Divergent  Mhc  Alleles 

Each  Mhc  class  II  allele  has  the  capacity  to  bind  and 
present  a  varied  yet  specific  set  of  peptides  derived  from 
processed  antigens.   The  inability  of  a  specific  class  II 
allele  to  bind  and  present  a  fragment  from  processed  antigen 
results  in  the  loss  of  immune  responsiveness  for  that 
antigen  in  individuals  homozygous  for  that  Mhc  allele  (Buus 
et  al.  1987;  Guillet  et  al.  1987).   These  Mhc-linked  defects 
in  immune  responsiveness,  termed  Ir  gene  defects,  either 
reflect  the  spectrum  of  peptides  that  are  not  bound  by 
products  of  specific  Mhc  genotypes,  or  reflect  the 
functional  deletion  of  certain  T  lymphocytes  with  particular 
antigen  receptors  during  T  lymphocyte  ontogeny.   If  all  the 
Ir  gene  defects  in  a  species  were  combined,  they  would 
represent  the  immune  response  void  for  that  species 
(diagrammed  in  Figure  5-2) .   Endemic  pathogens  of  that 
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species  would  tend  to  evolve  antigenicity  focussed  within 
this  void.   Each  Mhc  allele  would  be  capable  of  initiating 
immune  responses  against  a  specific  subset  of  the 
antigenicity  of  the  void.   This  concept  is  illustrated  for  2 
lineages  of  Mhc  alleles  in  Figure  5-2.   Mhc  alleles  with 
highly  divergent  forms  of  the  antigen  binding  site  would 
cover  relatively  unique  sections  of  the  void  (ie.  separate 
patterns  around  LI  (lineage  1  alleles)  and  L3  (lineage  3 
alleles)  in  Figure  5-2) .   Alleles  with  similar  antigen 
binding  sites  and,  therefore,  presumably  binding  properties, 
would  cover  overlapping  segments  of  the  immune  response  void 
(circles  surrounding  either  LI  or  L3  in  Figure  5-2) . 
Selection  will  consequently  favor  the  maintenance  of  alleles 
with  highly  divergent  antigen  binding  sites  because  such 
alleles  will  detect  non-overlapping  subsets  of  pathogen 
antigenicity.   Mhc  class  II  molecules  with  antigen  binding 
sites  altered  by  powerful  single-step  mutagenic  events,  such 
as  intra-exonic  recombination  between  alleles  composed  of 
highly  divergent  antigen  binding  site  segments,  and  codon 
deletions  would  be  favored  by  selection.   This  is  because 
such  novel  alleles  would  detect  segments  of  the  immune 
response  void  that  do  not  overlap  with  those  segments  of  the 
void  already  covered  by  existing  Mhc  alleles. 

Divergent  allele  advantage  could  interact 
synergistically  with  overdominance,  resulting  in  the 
preferential  maintenance  of  alleles  with  very  different 
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antigen  binding  properties.   Since  immune  response 
overdominance  is  based  on  the  enhanced  capacity  of  Mhc 
heterozygotes  to  respond  to  a  greater  spectrum  of  foreign 
antigens,  heterozygotes  with  alleles  derived  from  different 
lineages  (L1/L3  as  opposed  to  Ll/Ll1) could  present  a  broader 
array  of  antigenic  peptides  than  two  alleles  derived  form  LI 
or  L3.   Consequently,  L1/L3  heterozygotes  would  be  favored 
over  either  Ll/Ll'  or  L3/L3 '  heterozygotes  and  homozygotes. 

Divergent  allele  advantage  can  interact  with  rare 
allele  advantage  to  maintain  Mhc  polymorphism  by 
preferentially  favoring  rare  alleles  with  unique  binding 
properties.   For  example,  consider  a  population  in  which  LI 
alleles  are  prevalent,  the  endemic  pathogens  will  evolve 
antigenicity  that  allows  them  to  take  advantage  of  the  Ir 
gene  defects  inherent  in  alleles  of  this  lineage.   As  a 
result,  alleles  in  L3  would  be  more  likely  to  have  antigen 
binding  sites  with  properties  capable  of  presenting  newly 
evolved  antigens  that  exploit  LI  Ir  defects  than  another 
antigen  binding  site  structurally  related  LI  allele.   Thus, 
divergent  allele  advantage  could  interact  with  rare  allele 
advantage  to  preferentially  maintain  alleles  with  highly 
divergent  antigen  binding  traits. 

Disease  resistance  is  a  multigenic  trait,  and 
consequently  the  model  described  above  represents  a 
simplification  of  host-pathogen  interaction.   In  addition, 
the  Mhc  is  a  tightly-linked  multigenic  family  rather  than  a 
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single  gene.   This  would  tend  to  weaken  the  strength  of 
selection  at  any  single  locus.   Nevertheless,  the  same  model 
would  hold  if  the  lineages  of  alleles  depicted  in  Figure  5-2 
were  replaced  with  Mhc  haplotypes. 
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